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EDITOR’S FOREWORD 


The problem of communicating in a coherent fashion the recent developments 
in the most exciting and active fields of physics seems particularly pressing today. 
The enormous growth in the number of physicists has tended to make the 
familiar channels of communication considerably less effective. It has become 
increasingly difficult for experts in a given field to keep up with the current 
literature; the novice can only be confused. What is needed is both a consistent 
account of a field and the presentation of a definite “point of view’ concerning 
it. Formal monographs cannot meet such a need in a rapidly developing field, 
and, perhaps more important, the review article seems to have fallen into 
disfavor. Indeed, it would seem that the people most actively engaged in devel- 
oping a given field are the people least likely to write at length about it. 

FRONTIERS IN PHysics has been conceived in an effort to improve the 
situation in several ways. First, to take advantage of the fact that the leading 
physicists today frequently give a series of lectures, a graduate seminar, or a 
graduate course in their special fields of interest. Such lectures serve to sum- 
marize the present status of a rapidly developing field and may well constitute 
the only coherent account available at the time. Often, notes on lectures exist 
(prepared by the lecturer himself, by graduate students, or by postdoctoral 
fellows) and have been distributed in mimeographed form on a limited basis. 
One of the principal purposes of the FRONTIERS IN Puysics Series is to make 
such notes available to a wider audience of physicists. 

It should be emphasized that lecture notes are necessarily rough and 
informal, both in style and content, and those in the series will prove no excep- 
tion. This is as it should be. The point of the series is to offer new, rapid, 
more informal, and it is hoped, more effective ways for physicists to teach 
one another. The point is lost if only elegant notes qualify. 

A second way to improve communication in very active fields of physics 
is by the publication of collections of reprints of recent articles. Such collections 
are themselves useful to people working in the field. The value of the reprints 
would, however, seem much enhanced if the collection would be accompanied 
by an introduction of moderate length, which would serve to tie the collection 
together and, necessarily, constitute a brief survey of the present status of the 
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field. Again, it is appropriate that such an introduction be informal, in keeping 
with the active character of the field. 

A third possibility for the series might be called an informal monograph, 
to connote the fact that it represents an intermediate step between lecture 
notes and formal monographs. It would offer the author an opportunity to 
present his views of a field that has developed to the point at which a sum- 
mation might prove extraordinarily fruitful, but for which a formal monograph 
might not be feasible or desirable. 

Fourth, there are the contemporary classics—papers or lectures which 
constitute a particularly valuable approach to the teaching and learning of 
physics today. Here one thinks of fields that lie at the heart of much of present- 
day research, but whose essentials are by now well understood, such as quantum 
electrodynamics or magnetic resonance. In such fields some of the best pedago- 
gical material is not readily available, either because it consists of papers long 
out of print or lectures that have never been published. 


The above words, written in August, 1961, seem equally applicable today 
(which may tell us something about developments in communication in physics 
during the past decade). Richard Feynman contributed two lecture note vol- 
umes (“Quantum Electrodynamics” and ‘‘The Theory of Fundamental Pro- 
cesses”’) to the first group of books published in this series, and, with the publica- 
tion of the present volume and the forthcoming publication of ‘“‘Photon-Hadron 
Interactions,” it gives me special pleasure to welcome him back as a major 
contributor to FRONTIERS IN PHYSICS. 

“Statistical Mechanics: A Set of Lectures” will be of interest to everyone 
concerned with teaching and learning statistical mechanics. In addition to 
providing an elegant introduction to the basic concepts of statistical physics, 
the notes contain a description of some of the many original and profound 
contributions, (ranging from polaron theory to the theory of liquid helium) 
which Professor Feynman has made in this field. 


Urbana, Illinois DAVID PINES 
June, 1972 
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CHAPTER 1 


INTRODUCTION TO STATISTICAL MECHANICS 


1.1 THE PARTITION FUNCTION 


The key principle of statistical mechanics is as follows: 
If a system in equilibrium can be in one of WN states, then the probability of 


the system having energy E, is (1/Q)e~ ®"/*7, where 


N 
Q= y 7 EnlkT 


n=1 


k = Boltzmann’s constant, and T = temperature. Q is called the partition 
function. 

If we take |i) as a state with energy E; and A as a quantum-mechanical 
operator for a physical observable, then the expected value of the observable is 


nah, +) ALS\ 97 E/kT 
<A) 0 fy “ilAlive . 


This fundamental law is the summit of statistical mechanics, and the entire 
subject is either the slide-down from this summit, as the principle is applied to 
various cases, or the climb-up to where the fundamental law is derived and the 
concepts of thermal equilibrium and temperature T clarified. We will begin by 
embarking on the climb. 

If a system is very weakly coupled to a heat bath at a given “temperature,” 
if the coupling is indefinite or not known precisely, if the coupling has been on 
for a long time, and if all the “fast” things have happened and all the “slow” 
things not, the system is said to be in thermal equilibrium. 

For instance, an enclosed gas placed in a heat bath will eventually erode its 
enclosure; but this erosion is a comparatively slow process, and sometime before 
the enclosure is appreciably eroded, the gas will be in thermal equilibrium. 

Consider two different states of the system that have the same energy, 
E, = E,. The probabilities of the system being in states r and s are then equal. 
For if the system is in state r, any extremely small perturbation will cause the 
System to go into a different state of essentially the same energy, such as s. The 
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E,=H;,+£&,=H,+E£, 


Fig. 1.1 Energy levels in a system S and a heat bath H. 


same is true if the system is in state s. Since the system remains in contact with 
the heat bath for a long time, one would expect states of equal energy to be 
equally likely. Also, states of different energies would be expected to have 
different probabilities. 

_ Because two states of the same energy are equally probable, the probability 
of a state having energy, £ is a function only of the energy; P = P(E). 

Now consider a system, S, in equilibrium with a large heat bath, H (see 
Fig. 1.1). Since experience shows that the behavior of a system in equilibrium is 
independent of the nature of the heat bath, the bath may be assumed extremely 
large and its total energy E very great. Also, the possible energy levels of the 
heat bath may be assumed quasi-continuous. 

Let the energy levels of the heat bath be denoted by H,. These levels are 
distributed quasi-continuously. Let the energy levels of S be denoted by E;. 
Then H; > E; for all i, 7. The bath plus the system can be thought of as a new 

system, 7, which i is also in thermal equilibrium. 

T has a definite energy, but as that energy is not fixed exactly (the bath is in 
contact with the outside world), we can assume that the energy may be anywhere 
in the range Ey + A. If A is sufficiently small, we can assume that the states of 
the heat bath are equally likely in the range H; + A. Let n(H,) be the number of 
states per unit energy range in the heat bath H around energy H,. 

The probability, P(E,), that S is in a state with energy E, is proportional to 
the number of ways S can have that energy. In other words, it is proportional 
to n(E, — E,)2 A = the number of states of H that allow T to have energy in 
the range Ey + A. Then 


P(E,) = n{Eo — E,) = ginn(Eo~Er)— Inn(Eo~ Er), 
P(E,) = (Eo — Ey) 
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Remember that FE, « Ep. If it is true that (d/dE) In n(E) = B(E) is almost 
constant for E in the range under consideration,* then we can say 


P(E,) = oe” B(Er— Er’) 
P(E,) 


> 


so 
P(E,,) oc e7 BF’, 


Normalization requires that P(E;) = (1/Q)e~°' where O = ¥; e7 9¥'. 
The fundamental law has just been shown to be quite plausible. But for 
those who doubt the constancy of B(E), let us consider some examples. 
First, assume that the heat bath consists of N independent harmonic oscil- 
lators. The energy of the bath is 
N 
F= ¥ njfw,, 
i=1 
where we assume that n; is very large, and we neglect the zero-point energy. 
How many states are there with energy less than F? If N = 2, we have the 
situation, shown in Fig. 1.2. 
For N = 2, the number of states is proportional to the area of the triangle. 
Clearly the number of states with energy less than F for large F is proportional 
to F% ;so the number of states per unit energy range is n(F) oc (d/dF)F™ oc FN7!, 


In n(F) = constant + (N — 1) In F, 


dlnnF)  N-1 ,N=1 


since E, « Eo. 
dF F Eo 


* The basic assumption made here is, that the system governing the probabilities has 
a quasi-continuous spectrum in the region considered and for which there is no partic- 
ular characteristic energy. For such a system, if ¢, and ¢2 are two energy values, then, 
since energy is only defined up to an additive constant ¢, we should have 


fs) _ fe: + 2) 

fe.) f@, + 8)’ 
where f(e) is the probability. Defining 

fe) = gle — &), 


we obtain 
ge)g(é, — &2) = gO)g(e, — & + 28), 


which is (uniquely) solved by 


ge) = gO)e"** ~— (B constant), 
that is, 
fe) = g7Be1-22) 


f(é2) 
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F = constant 


ny 
Fig. 1.2 States with energy less than F for two independent harmonic oscillators. 


For N large, 
1 1 


7 energy per oscillator WwW’ 


where W = energy per oscillator. 
Alternatively, we can go back to the equation 
P(E,) = NE = r) 
P(E,) (Eo _ E,) 
and plug in N directly. We get 
n(Eo = r) has (Eo os Ey"? 
n(Eo = Ey’) (Eo ~ E,)"-? 
je INWYE Ee 
(1 — E,|/NW)X-} eB l¥ 


as N — o.* 


We then get the same value for B as by our previous method. 


Asa second example, consider the case of a heat bath consisting of N particles 
in a box. 


2 2 2 2 2 
paoPat Pit Pi. Pet Pht Ph. y Pay + Pow + Pew 
2m 2m 2m 


Assuming periodic boundary conditions, 


P= 2nh(integer) 2 2n,th 
x L L > 


* Problem: Why is W independent of N? 
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Fig. 1.3. Number of states with energy less than F is roughly equal to the area of the 
circle shown. 


where L is the length of the sides of the box. Then 


2 
poo [n2, + m2 +n? + n2,4+-+>+ n2]. 
2Lm 


If there were just two 7’s, we would calculate B with the help of Fig. 1.3. 
The number of states with energy less than F is roughly equal to the area of the 


circle with radius JF (Lm/nh). For N particles, we must use a hypersphere in 3N 
dimensions, and the number of states with energy less than F is proportional to 


(/F)*". It follows that B is roughly constant, and equals 1/(2W), where W = 
energy per particle. 
Consider two independent systems S, and Sz with energy levels A; and B;. 
The probability of system S, having energy A; is 
e PA: 
ye As , 
Now place S,, and S, in loose contact with each other and consider the combined 
system S; = S, + Sg, with combined energy 7, = A; + B,. 
e ArtAit+ By) 
Py e FrAi >; e FrB; 
e bras e7 brB; 


= y; e brAi > e FrB; . 


The probability of system S; being such that system S, has energy A, is 


e Brai e7 brBy eBTAi 
pee * 5 ee yy e7 PrB; ~ di e BrAi Ad, 


P(A) = 


P(T,) = P(A; + B;) = 


J 
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Similarly, 
e Ars; 


P,(B;) = Fe PrBy 
J 


We see that if two systems are placed in loose contact with each other, in 
equilibrium they have the same 8. Temperature has a similar property, and in 
fact, by the way temperature is conventionally defined, 8 = 1/kT where k is 
Boltzmann’s constant.* 

From the basic principle of statistical mechanics, once 


= e7 EnlkT 
2-5 


is known, all thermodynamic properties can be found. We define F, the Helm- 
holtz free energy, so that 


O = Vie BHAT = g FIT, (1.1) 
F= -kTinQ = —kT In (5 eee), (1.2) 
S = entropy, is defined as —k }° P, In P,,, (1.3) 


* Furthermore, suppose our spectrum has a density of states n(e) which is relatively 
constant when ¢ —> fe, that is 


n(Be) = n(e)q(B) (*) 
Then U() (see 1.7) = C/B and we recognize in C the usual (total) heat capacity, so that 
Oi Ge 2 Gat ee 
Bi Be B 


which is the well-known (experimental) formula. 
Equation (*) has the unique solution 


n(eé) = Ae" 


which is, indeed, the form of n(e) for microscopic bodies (see pp. 3, 5). 
Also, it can easily be seen (from 1.7 below) that 


dU 1 


= -—5 D (& — ay)7e Pmt < 0, 
dp 20? im 


so that U is a decreasing function of 8. Suppose we mix two systems having different 
B’s, B, < B,. After mixing there will be a common 8, according to 


U,(B,) + U2(B2) = U,(B) + U,(B), 


whence, since the U’s decrease, 8B, < B < B,, and energy flows from the low-8 body to 
the high-8 body. This also fits into our intuitive perception of temperature. 
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where 
e” EalkT (1.4) 


From Eq. (1.2) it can be seen that 


_E 
s a = -kE E e~ Poik ae In 0} =§ (1.5) 
P = pressure = x — P, o = — (+), (1.6) 
U = average energy = ae | ON el (1.7) 
But ; 
Ue et ee gO ee (1.8) 
Q OT aT \T éT Ai/T)T 


2 
o, = (22) _ OF _ a (rt) = 2 oF BE 
4 


a2 
-_T (1.9) 


To get a clearer idea of the nature of pressure, consider a possible alternative 
definition. The Hamiltonian operator for the system is dependent on the volume. 
Set H = Hamiltonian = H(V). We can take as a pressure operator Pop = 
— d0H/éV. 


1 ~E; 
P = > <i|Poplide Ei (kT 
0% 
Our alternative definition is equivalent to the first one if 


. oH. OE, 
if — =, 
<i ay |i> AV 
But 
OE gg Ee) = EMV) tig SEHD = Ci 
OV yay Vi—vyV Voy V’-v 


where |i”) is the eigenvector of H’ corresponding to |i), the ith eigenvector of H. 
H' = A(V’). 
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H! = H + (H’ — 8H); so we can apply first-order perturbation theory to 
say that <i’| H’ |i) = <i| H |i) + <i] H’ — H |i) for H'’ - H=0. Then 
GE; <i|H’ — Hlid _ ,, 0H 


— = lim i id. 
OV vey V'—vV ay IP 


Our two definitions are equivalent. 

Because H may be a function of the shape of the system, as well as its volume, 
our definition of pressure might depend on how the volume is changed. In 
general, for any parameter «, there is a force that can be computed by 


(Force), = 2 € =) eW EaykT 


Ou 


and we can write 

OE; ., OH ,. 

— = ¢i| — |i). 

ais | li> 

A third definition of pressure is 

0U 

P = — —(V,S). 
Ay ) 


That this definition is equivalent to the other two may be verified without diffi- 
culty from the equation U = F + TS. 

The equation S = —(GF/0T), holds only at equilibrium, where F is defined. 
Away from equilibrium, S always increases with time. To see this, note that 
time-dependent ae theory gives 


=. (Veunl?Pn — |VeunlPm)s 


where |Vaml|? is the probability per unit time of transition from state n to state m, 
and |Vaml? = |Vmnl?. Then 


m7 eS (S od hee ie? “In P, 


dt dt 
since 


d 

= P; _— oe 
t dt » dt 
Then 


& = -ky [Val (Pr _ P,,) In Pry 
t mn 


= = Y Wonl (Pa a P,,)(n Pr —In P,). 
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But each term in the sum is negative, for the sign of P, — P,, is opposite to the 
sign of (In P,, — In P,). So dS/dt > 0. 

Before we start to make calculations, note that if we have a system that is a 
combination of several independent subsystems, with Ey.) = E, = i Ey, = 
>; energies of the subsystems, 


F = -kTiInQ = —kTIn Ye *™* = —kTIn YY e7 Ph Em 


Np M2, ene 


ering] (gem) =-urg[inget] = 4, 


The free energy of the whole system is the sum of the free energies of its non- 
interacting parts. 


1.2 LINEAR HARMONIC OSCILLATORS 


Consider a system of harmonic oscillators in thermal equilibrium. The partition 
function Q, free energy F, and average energy of the system of oscillators can be 
found as follows: The oscillators do not interact with each other, but only with 
the heat bath. Since each oscillator is independent, one can find F; of the ith 
oscillator and then 


M 
F=YF, 
i=1 
(M oscillators). 
QO, = Y ew Bet (1.10) 
E, = h(n + 4) — from quantum mechanics (n = 0, 1, 2,...) (1.11) 


Q; = Vi en hedns 1/2)/KT 
oo 


e 7 heoi/2kT 1 —hoif/kT — 2heo;/kT 
2 = 1 — eW fokT F — e~ hoi/kT naire ase uae | we) 
F; = —kTInQ, = 5 eT lik Die PO) (1.13) 
U; = average energy of a single oscillator in thermal equilibrium 
1 pees 0. «OF, 
= —Y Ej e FnkT al 1.14 
0, = 0(1/T) T ore 
; : hoi {kT 7 7 
_ ho, | hae a Oy FO, (1.15) 
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F=YF,=) ke +kTin (i - cto (1.16) 


ho; ho; 
U = u=3/% + Seer 


It is customary to define an average nj, n;, according to 


U; = (n, + 2)ho;. 
Thus 
.: 1 
n, = Get : 
Returning to the example on p. 3 we see that for very high temperature 
(and U) we indeed have 
Wx U, = kT = 1/8, 
which is independent of U. 
Note that the contribution to F of the ith oscillator is negligible if hw; > kT 
(except for the Aw,/2 term). At low temperatures the high-frequency modes are 
“frozen out” and do not contribute to the specific heat. 


1.3 BLACKBODY RADIATION 


In dealing with blackbody radiation, our point of view will be as follows: In a 
cavity (blackbody), there are a great number of modes of oscillation. The 
number of modes per unit volume per frequency bandwidth is given by classical 
considerations. Each mode, however, behaves as an independent quantum 
harmonic oscillator, except that the ii@/2 term is neglected. That is, E, = nho. 
We want to get rid of the Aw/2 because it leads to infinite energy when there 
are an infinite number of modes. A Hamiltonian that eliminates the ha/2 is 


Hy = 4} + aig?) — Sr. 


With the above assumptions, we can obtain an expression for the energy 
per unit volume per unit frequency. 

First we will find the number of modes per unit volume per frequency (or 
wave number). 

Assume a gigantic box of dimensions a, b, and c. The demand is made that 
the waves be periodic at the walls of the box. 


i 


number of waves/cm in x-direction 


= number of waves in box (x-direction). 
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Because of the periodic boundary condition, 


a . 
7. = n, = an integer. 
Let k,, = 2n/4,. Then 
ak, _ 
2n x? 
and similarly 
ae = Ny, chs = ny. 
R 14 
a dk, = dn,, b dk, = dn,, c dk, == dn, 
2n 2n Qn 
3 
an _ dn, dn, dn, = abc dk = (abe) Tee ah 
(2n)° (2n)3 


For each k, there are two possible polarizations. Thus, the number of modes 
per unit volume with wave number between k and k + dk is 


2d?n d*k 
= 2 : 
(abc) (2n)° 


Now w = kc where cis the velocity of light. Also, the number of modes is so 
large that the sum over the modes can be replaced by an integral. 


Pos s. — hak) ]\ 2 d?k 
am if k,T in (: exp| kT ) (Ome (1.17) 


(Remember that the hw/2 term has been omitted.) 
From symmetry, 


i 2 [ir In (1 — exp Bes) 4nk™ dk : 

V k,T (2n)° 
U 2 | hook) exp [—hak)/k,T ] 4nk? dk 
pee : (1.18) 
V 1 — exp [—fa(k)/k,T] (22)? 

Let x = haf/k,T = hkc/k,T. Then 


4 feo} —x 
U_ 8n wary f ree 


Vo (Qn) fc? |, 1-e* 
n(k,T)* ; 
= ee Fe 
2,4 
pa (1.19) 


~ 15#3¢3" 
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The preceding results can be summarized and put in a more familiar form by 
replacing Aw with Av. 
The number of modes/unit volume between k and k + dk is 
Ank? dk — k? dk _ 8nv? dv 
(2x)? nn? e 


J 


since k = 2nv/c. The average energy of an oscillator of frequency v is hv/e"”/*7—1. 


Thus, the energy per unit volume between v and v + dv is 


dU _ 8nv? hy dv 


U, dv = - <3 WRF * (1.20) 
This is the Planck Radiation Law, and is the same as Eq. (1.18). 
fu. dy = oT* 
is the Stefan Boltzmann Law. 
For T very large (kT > hy), 
2 2 
Ee Coase = om Pay: (1.21) 


ce 1+¢(hjkT)-1. @& 


This is the Rayleigh-Jeans Law. 
Let F* = F/V and U* = U/V. 
Cy = OE 40T? 
OT 
is the heat capacity of a gas of photons in equilibrium with the container. 

If an oscillator (or mode) is excited to the Nth level, that is, if E = hw,N, 
one says that there are N photons with energy #w;. Since photons are defined as 
the degree of excitation of a mode, one cannot consider a permutation of photons 
as a new state. That is, photons are indistinguishable. This point is the basis of 
quantum statistics and will be dealt with in a later section. 


1.4 VIBRATIONS IN A SOLID 


We want to find the specific heat of a solid. However, en route to the specific 

heats, we will derive important results that will prove useful in many other cases; 

chief among these results is the calculation of the normal modes of a crystal. 
The program will be as follows: 


1. Consider the solid to be a crystal lattice of atoms, each atom behaving as an 
harmonic oscillator. These oscillators are, of course, coupled. 
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2. Find the normal modes of the system. There are as many normal modes as 
there are degrees of freedom; namely, 3(AN), where (AN) is the number of 
atoms in the crystal. The modes behave as independent quantum oscillators. 


3. Given the modes, calculate F, the free energy. 


4. From F, calculate C, and any other thermodynamic quantities of interest. 
Once F is found, we can quickly calculate U, the total energy of the system. 
In the following derivation, we will calculate C, directly from U without 
always writing down F first. 


Method of Labeling 


Consider a crystal with A atoms per unit cell. For convenience, assume that the 
unit cell is a rectangular solid of dimensions a, b, c, along three mutually per- 
pendicular axes, x, y, z. Let the origin be at the ‘“‘center” of a cell. This cell can 
be denoted by the triplet (0, 0, 0). The cell to its right along the x-axis is denoted 
by (1, 0, 0), and so on. Thus, any cell can be denoted by a vector N = n,a + 
nyb + nc wherea = ai,b = bj,c = ck. If there are A atoms/cell, 34 additional 
coordinates must be given to locate each atom. Let a denote one of these 34 
coordinates. 

Call the displacement from equilibrium of the coordinate in the Mth cell 
Z,,n- (We either consider m = mass equal to | or absorb it in Z.) 

Z,,n+m is the displacement of an atom in a cell close to N; if, for example, 
there are two atoms A,, A, per cell, the displacement of A, in the x direction is 
denoted by Z, y, that of A, (in same cell) by Z, y, and that of A, in an adjacent 
cell by Z; y4y- 1 = (1, 0, 0) or (0, 1, 0), or (0, 0, 1). 


Normal Modes 
T=(/2 YP Zin = (1/2)D Zin (1.22) 


particles 
and directions 


OV 
V = V(0O) + Z, 
2 p Gas " 


gel Ps ON, ge pee 4193) 
2 «,6.N.M \OZ,~ OZg n+m)z=0 a,N“B,N+M ; 


Assume that the electrons in the crystal always have time to adjust them- 
selves to the configuration with lowest energy, even when the crystal is vibrating. 
In this configuration, there is no net force on the nuclei when the Z, y are 


Zero, SO 
( OV ) = 
0Z4N Z=0 
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The additive constant, V(0), will not affect our answers, so we might as well 


drop it. Let 
Perera ees CM. 
0Z,,n 9Z5.n+m/z=0 


a number that depends on the relative positions of the cells of the two atoms, and 
not on their absolute positions. Note that Cy = C,;,™. 

Neglecting higher orders, take 

V=4 DY CBZ nZp.n+m- (1.24) 
a,B,N,M 

For low temperatures it is not too unreasonable to neglect higher orders, 
because the separation between atoms is of the order of 1 A and at room 
temperature the vibrations have amplitude of the order of 0.1 A. But we should 
not be too surprised if experiment shows our idealization to be false. In matters 
such as the one under consideration, the general approach is to make idealiza- 
tions and then try to find corrections to our assumptions that will give better 
results. 

In order to motivate the procedure that we will use for finding the vibrations 
of a solid, let us consider the classical problem of vibrations of coupled oscil- 
lators. Let the Hamiltonian be 

Pi? 


= Lom, 


t 


+ » 3C;9:9}; 
i 
where the q} are the coordinates of the amount of displacement from equilibrium, 
P; = M;,q; is the momentum, and Cj; = Cj, are constants. To eliminate the 
constants M,, let 


a= 4VM, and Cc, =~, 
/M.M; 
ps opscrenean a 1 P'. 
04; JM; 
Then we get 
H= ae. Pi + 5 Cuaity (1.25) 
i ij 
The equations of motion are 
F oH ; —0H 
i=, =P, Pi = =- Ci). 
AS OP. aq, 2 Cit 


We now break the motion of the system into modes, each of which has its own 
frequency. The total motion of the system is a sum of the motions of the modes. 
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Let the «th mode have frequency w, so that 
gq” = e@7 i@at a 
for the motion of the «th mode, with a independent of time. Then 
waza? = ¥ C,,a%. 
7 
The classical problem of vibrations of coupled oscillators has just been re- 


duced to the problem of finding the eigenvalues and eigenvectors of the real, 
symmetric matrix, ||C;;||. In order to get the w, we must solve the equation 


det ||C,; — 76,,|| = 0. (1.26) 
Then the a‘ (the eigenvectors) can be found. It is possible to choose the a so 


that 
yaa) = 65. 


t 


The general solution for q; is 
qi = > Car. 
a 


where the C, are arbitrary constants. If we take Q, = C,e"'°“, we get g; = 
Y, 4Q,. From this it follows that 


¥ aa, = F aPalQ, = 54,0. = 0, 
Making the change of variables, 0; = >; afg;, we get H = >, H,, where 
H, = 4Pz + 40797. 
This has the expected solutions Q, = C,e7'°*. 


Now suppose we wish to solve the quantum-mechanical problem of coupled 
oscillators. Again we have 
P? bata 
or oa Ci39i9} 
eer: 


t 


H= 


where this time 


Making the same change of variables as before, we get 
0. = aq, = Y aM, 4: 


HH 
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where 
= h2 ng 
2 602 


It follows immediately that the eigenvalues of our original Hamiltonian are 
E=Y¥,(N, + fo,. The solution of a quantum-mechanical system of coupled 
oscillators is trivial once we have solved the equation 


+ 40707. 


0 = det |C,, — @76,,|| = det |C/,//M,M, — 06,)|). 


If we have a solid with 4(1073) atoms we must apparently find the eigen- 
values of a 1073 by 1073 matrix. But if the solid is a crystal, the problem is 
enormously simplified. The classical Hamiltonian for a crystal is 


H=} a ZaN ae 3 » Cu5ZanZp,.N+M> 
a,N a,B,N,M 


and the classical equation of motion is (using C4 = C;,) 


M 
Z,,N = by CapZp.n+M- 
M,p 


In a given mode, if one cell of the crystal is vibrating in a certain manner, it 
is reasonable to expect all cells to vibrate the same way, but with different phases. 
So we try 

Zan me, a, K)e7 ca nein 


where K expresses the relative phase between cells. The e!*’ factor allows for 
wave motion. We now want to find the dispersion relations, or @ = @(K). 
w7a,e'* = Dy (cia, e*%, 
MB 


Let 
Yap(K) -= 2 Cres 


(Note that y,,(K) is Hermitian. See end of p. 18). 
Then wa, = Yg Yag%g, and we must solve the characteristic equation of a 
3A-by-3A matrix: 
det |lyag — @75zgl| = 0. (1.27) 
The solutions of the characteristic equation are 
aK) = af, 
where r runs from 1 to 3A. The motion for a particular mode can be written 
ZONK) = ai{ Ke OY ge-N 


where 
Yaak” = 6,,. 


a 
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Then the general motion can be described by 


yA = y C{K) a(Kje Oy gen 


K,r Vn 


where the C,(K yv n are arbitrary constants, and y is the total number of unit 
cells. _ 

The factor 1 Iv. n is inserted to make things look nicer later, but it is not 
strictly necessary. 

Let O,(K) = C(K)e~®", O.(K) describes the motion of a particular 
mode. 


ZN = 2 O,(K)ai(K)e™ N(1/-/n). (1.28) 

It follows that 
O(K)  YZ,,war"(Kje™'%, (1.29) 

a,N 
and the Hamiltonian for the system is 
1 z 
H= en [zx +t >» CMP ro] 
1 

= p* [10,(K)? + ©? (K)10,(K))7). (1.30) 


If we consider Q,(K) and its complex conjugate to be independent variables, we 
get the same equations of motion from 


H= » [1O,(K)|? + 07 (K)10,(K)7]. (1.31) 


15 SPECIFIC HEAT OF A CRYSTAL 


We now want to find F, so we must sum over all possible modes. 

For one mode, F = kT In (1 — e~*°/*T), There is a quasi-continuum of 
K values, and for each K value there are 3A ws. Assuming the crystal has 
volume V, and assuming periodic boundary conditions, we have in the range K 
to K + dK approximately d3K/(2x)*V modes with a given p. Thus, 


3 
Fer | yA) 4 ering 4 entexoory OK 
V oe (2x) 
d?K = dK, dK, dK,. (1-92) 


fx is shorthand for 


[" (i ic 
—nija J—n/b ie 
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The reason for the limits —2/a, z/a is as follows: the factor K was introduced in 
ef N, Then efXN a efKxnxagiKynybeiKenee: nny. n, are integers. But, 


iKxNnxa i(Kx + 22/a)n,a 


e =e 


=e iKxnxa er 


No new modes are introduced if K ranges beyond the prescribed limits. 

We now demonstrate that the specific heat of a crystal determined by our 
present method agrees with experiment; that is, at high 7, C, = 3R. At low 7, 
C, > Oas T?. 

We choose to look at U rather than at F. First we note that there is a max- 
imum frequency, @y. Two adjacent atoms can be no more than 180° out of 
phase, and thus the minimum wavelength must be of the order of twice the 
atomic spacing (call it 2a). Thus Kya, © 2n/2a = n/a, and wy = wl?(Ky). 

First, we consider the high-temperature limit. For kT > hay, all modes are 
excited to approximately the same energy; that is, if there are N atoms, 


ho; ho; h 
U= a ; =z o, + kT 
» 2 » exp (fa/kT)—1 2 » » 
= U, + 3NkT = U, + 3RT, (1.33) 


if N is Avogadro’s number. C, = 3R for large T. Note that we might as well 
neglect the zero-point energy, Up. 

For very small values of T (kT « hwy), the behavior of C, can also be 
approximately determined. 

Ignoring the zero-point energy, we see that the contribution to U of the high- 
frequency modes is very small, because 


fw ar; 
exp(fw/kT) — 1 


when hw > kT. We also know that when K is zero, there are three zero-fre- 
quency modes arising from translation of the entire crystal. For small K, there 
should be three very low-frequency modes. To see how those modes vary with 
K, consider y,,(K) for low K. Since 


Yap(K) = ) Cr%e'™™, 
M 

where the C¥ are real, 

Yp(K) = >) Cee R= Yi Cp atelh™ = y,9(K). 

M M 
Also, for real (K), yj.(K) = ?pa(—K), 80 ¥ag(K) = ¥.(—K), and 
S(w, K) = det |ly,g(K) — ©76,|| = det |lyg.(K) — wall 

det llyap(— K) = od ,¢l = f(a’, —K). 
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Then, because f is an even function of K, 
of (2 
— (w*,0) = 0 
aK (w*, 0) 


Since the characteristic equation must have a solution with m = 0 for K = 0, 
f(, 0) = 0. The characteristic equation becomes 


+ higher order terms, 
a” Nek, aK, : 


0 


so that, for low w and |K] we have 


Of , 
. a; COS a; ——— 5K, OK, ) 
=IK : 
soa floor 


where cos a; are the direction cosines of K. 

In other words, w = [K|V, where V is the velocity of sound, which may 
depend on the direction of K when the eigenvalues of 4?//0K;dK;|y are not all 
equal. But, for sufficiently low frequency, V does not depend on the frequency. 
This is the Debye approximation. 

For convenience, assume that the three sound velocities are equal; that is, 
@ = VoK. 


ho, ho, 
= | 2 * pair — 4 


ho, Km ho({K) 4nK? 
+ 3 = dK; K= 
y { eholkT a 1 (2n)° 


The factor 3 before the integral is there because for each K there are three modes 
with low a. 


Ot Si i = VoK (1.34) 
= Uot xy: prone 4 = where @y = VoK max : 


K,,,, can be found by setting the total number of modes equal to 3(AN), where 
(AN) is the number of atoms in the crystal. 


Kmax 3 Kmax 3 
ae = | 3V aK? dK = Ku 


7 ‘ 
o - (2n)° 2n? 


ee \ (2n)° 


Then 


Ky = (6n7p)'/? where p is the number of atoms per unit volume. 
M Pp Pp p 
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Let 
ho Oo kTx F k°T x3 kT dx hKV 
X¥=—=-35; WOF=—; ao” do = ; Qo = —_. 
kT T h he h k 
44 Om/T 3 
Ea Vey ele is BON i. pet (1.35) 
V Vi _2n*h3V2 J, e~—1 
Here © is the Debye temperature. 
C, = 4AT? (1.36) 


where A > 17k*/10h3V3 as T/O > 0. 
So C, is proportional to T? at low temperatures. 

The T° dependence of the specific heat at low T can be made more plausible 
as follows: We assume that for fiw, > KT, the contribution to the energy of the 
@, mode is negligible and for Aw, < kT, the contribution is kT. 

The number of modes with wave number less than K is 


Kank? 4 _ K°V 
o (2x)? 2n? - 


At low T, K = @/V,. Therefore the number of modes with frequency less than 
@, is o3V/2n?V3. But ha, = kT or w, = kT/h and 


n = (number of modes with frequency less than w,) = k°T?V/2n?V3h°. 


The energy varies as 


373 4mn4 
UC... =e fe (kT) = ai a 
Vo 2n*hPV5 an hV 5 
And thus C, is proportional to T?. The numerical factor is, of course, not 
correct. 
If the mode with frequency «, is excited to the nth level, E; = fiw,(n + 4), we 
say that there are n phonons of frequency w, and energy Aw, in the crystal. 
Figure 1.4 shows the general form of the specific heat of a solid as a function 
of temperature. 


Example: Assume the unit cell to be a cubic lattice with one atom per cell 
(Fig. 1.5). Each atom behaves as an harmonic oscillator, with spring constants 
k, (nearest neighbors), and k, (next-nearest neighbors). This case is fairly simple, 
and we can simplify the notation: « = 1, 2, 3. 


ZiN = Xy, Z2N = Yn, Z3,N = Zn. 
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3RR SS ee ee — 


—» T/8 
Fig. 1.4 Specific Heat of a Solid. 


We wish to find the three natural frequencies associated with each K of the 
crystal. To do this, we must find CM, and then y, ,. In complex coordinates 


v=) Ve 
«8 
where 
Vig = C4ZINZ6.N+M- (1.37) 
N,M 
For example, 
Vy = 2 Ch XnXnim- (1.37’) 
N,M 


If we express the displacement of atom N from its normal position as Xy, then 
the potential energy from the distortion of the spring between atoms N andM is 


M 2 
3Kmu | (Xn — Xnim)° —— |] 5 
f | 


Fig. 1.5 Cubic lattice with one atom per cell. 
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Ky = K, for N + Ma nearest neighbor to N, Ky = Kz for N + Ma next- 
nearest neighbor. 

In summing over N and M to get the total potential energy we must divide 
by two, for we count each spring twice. If we use complex coordinates, however, 
we multiply V by two to get the correct equations of motion. 


val S Kylie — Xue 24", (1.38) 
> he N N+M iM| ; 
1 M 
Vis eee >a Ku —# “(xt a Xyim\Xn ats Xy+m) 
2 .N.M |M| 
1 Mx\ * * * * 
a 5 ye, Ku Ky i [(XnXny + XNamXnim) — (XNXwim + XN+mXn)] 


Ky (M2) [xhxw — HEX wan: (1.38’) 
NM i 


Comparing Eqs. (1.37’) and (1.38’), we see that 


2 
& M40 _ _ pelt’ 
“> Ku (a) cut? — —K,, Ga) (1.39) 


Here (M,/|M|)? = 1 for M = (+1, 0,0), $ for M = (£1, +1, 0) and M = 
(+1, 0, +1), and zero for the other nearest and next nearest neighbors. So 


C?, = 2K, + 4Kz, C#{% = —K,, and soon. 
In this way, all the CM, can be found. We can then calculate 


Ya,p(K) = py Coles]. 


We wish to solve det |y,, ~ @76,,,| = 0. 
For each K, there are three solutions for w. Thus we obtain 3N values of 


wo, w(K). 


1.6 THE MOSSBAUER EFFECT 


If a free, excited nucleus goes to its ground state by emission of a photon, the 
energy of the photon will be less than the excitation energy because the nucleus 
recoils. But if the excited nucleus is in a crystal, there is a finite probability that 
it will emit a photon with the full excitation energy. In other words, there is a 
finite probability that the state of the crystal after radiation will be the same as the 
initial state. Similarly, there is a finite probability that the crystal state will be 
unchanged by absorption of a photon. This effect, called the Méssbauer effect, 
can be discussed in terms of modes of oscillation of a crystal. For example, 
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suppose the temperature is absolute zero. Then the crystal must be in its vibra- 
tional ground state before emission of a photon. We will find an expression for 
the probability that the crystal is in its ground state after emission. 

Let R be the position of the excited atom, and let P be the momentum of the 
emitted photon. We will assume, without proof, that the amplitude for the 
crystal being in a given final state is 


a = (final |Ae'?’*/*| initial). 


Furthermore, we will consider only the vibrational states of the crystal, and will 
neglect any effect due to the fact that the nucleus changes state. 

Let the Mth atom be the one that emits the photon; let Roy be its mean 
position, and let Z,, be its displacement from that position. 


R = Rom + Zm- 


We can take the origin of our coordinate system at the mean position of the 
excited atom. Then Ry, = 0. 
A single one-dimensional harmonic oscillator in the ground state is described 


by the wave function 
1/4 
w=(2) eo oar/ny, 
mth 


where Q is the position measured in units such that the mass can be set equal to 
unity. The wave function of a crystal in its ground state is a product of the wave 
functions for each mode of vibration. 


1 
Werystal = Il ey is 7 OK) O7(K)|/28 


K,r nh 
We wish to calculate: 


is Il | | Weve. Wea 


where F is the probability that the crystal will remain in its ground state and C is 
a constant independent of P. As P — 0, F must tend to unity. So, if we can get 
a formula for F without the correct constant in front of it, the constant can be 
determined easily. We will ignore such factors as “A” and ‘“(w/2h)'/*.” 

We take Z, 1 = Z,9 as the displacement of the nucleus that emits the 
Photon. For simplicity, assume the momentum of the photon is in the « direc- 
tion. Then we wish to compute 


eee iI (n exp espe) (exp [iPZ,,o/h]) 
x (1 exp oe =) 


2 


(1.40) 
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But 
Z.0 = ¥, 0(K)a(K) +, 
K,r Vn 


Kye) 


F'? & T] || exp —— oer a exp [ (~. 7 -)o O,(K)a’ | dQ 06)| 
K,r 7] 


_ Pp2t,r 2 2) Ar 
oc TL exp | PO] = exp [yp Peer 
Kir 4ho(K)y ~ £24k wo(K)n |" 
Notice that 


«(Zan = (z aa }*0,(K )O2(K eK KIN :) 


K, 
K’, 


bel 


’ 


=>y h\a,(K)|? 


2 2 
= Die a,(K)| (OH) Be oneo™(K) 


So we can write 


F = exp E ie) Zim). (1.41) 


The value of F can also be found for nonzero temperature. If “‘i” is the 
number describing the state of the crystal, we must compute 


F, = (ile?! |p|? 


and then 
ok i a 
04 
eB = TI [<n exp ate 0.4m) 208) In>| exp [=e + per) 


where |) denotes the nth state of a single one-dimensional harmonic oscillator 
with frequency w(K). As 4 is very large, we can write 
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1 P? r eee 7 

1 ge ORM? + | 
hat(K) 
x exp | ~ (n + 4) —— iT | 


_ 1 PPQn + Dh 
2 nhi?20'(K) 


"(K 2 
|a,(K)| BET 


x fo + Iho K ] 


P?(2n + 1) 2 —(2n + Iha™(K) 
F 1- "(K I a Nd ACA 
« EM] satiate iy P| a 
© P?(2n + 1), 42 —(2n + 1)iiw 
— _— ko is poet Na 
I » P 2anh [acl 2kT 
1 + e@ RolkT 
1—- r2 pan 
“al 2wn A ail {ie tt 
f 1 + e ho/kT 
— E [aa ee | oer 
As before, we get for our answer 
F= op | (Za >|. (1.41) 


1.7 QUANTUM STATISTICS FOR A MANY-PARTICLE SYSTEM 


Consider a system of N identical particles, and assume that there is no interaction 
among them. Any two configurations of the system that differ only by an inter- 
change of two or more identical particles are regarded as one and the same state. 
Thus the state of the system is determined by giving the number of particles n, 
with energy ¢,. 

Our problem is to calculate the partition function Q with the condition 
xn, = constant. The values admitted for every n, may be 


(a) n, = 0, 1,2, 3,4,... Bose-Einstein case 


(b) n, = 0,1 Fermi-Dirac case 


Bose-Einstein statistics must be used for particles with integral spin (for 
example, He*) and Fermi-Dirac statistics for particles with half-integral spin 
(such as, electrons). For Bose particles, any number of particles may occupy a 
given state. For Fermi particles, however, there can only be one particle at most 
in each state (Pauli exclusion principle). 
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A state of the system is described by the set of numbers 7,, which can take on 
any set of values allowed both by the statistics and by the condition >, 2, = N. 


O= > -exp (-# Y nts) (1.42) 


Ni,N2,--. 


If there were no restriction on the number of particles, we could write 


o=I] (z ome), (1.43) 


and we would have in the Bose-Einstein case 


@= (jax): 


In the Fermi-Dirac case we would have 
Q= Il (1 + e7 F), 


Unfortunately, Eq. (1.43) is incorrect, because we have an auxiliary condition 
that ), 1, = N = constant. With this auxiliary condition, the problem of 
finding Q becomes much more difficult. It is possible, however, to get around 
the restriction by considering the system of particles to be a box connected to a 
large reservoir of particles (Fig. 1.6), and by assuming that it is possible for 
particles to pass to and from the reservoir. 

We further assume that the statistical mechanics of the total system acts as if 
it took energy p to remove a particle from the box to the reservoir. yp can be 
adjusted by, say, a voltage regulator. In the case of electrons in a metal, for 
example, y is the work function of the metal. In general, increasing yp will in- 
crease the expected number, (4), of particles in the box. If we can find (NV) as 


Fig. 1.6 System of particles considered as a box connected to a large reservoir. 
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a function of y, we can in principle select » so that any desired number of 
particles are in the box. 
For a given p, the energy levels of a particle in the box become ¢, — y, rather 


than &,. 
By the key principle of statistical mechanics (as applied to the box), the 


probability of the gas having energy E = >, n,(¢, — ) is proportional to e~ *. 
= YY exp —f[) xe, — »)]. (1.44) 


Ni,N2,-.- 


The summation can be made without restriction. We will now show how p is 
determined by N and how thermodynamic quantities depend on O™ or g = 


—1/B1nQ™. 
e~F9 = Qo — y exp (-# b n{&q - )) ; (1.45) 


M1,M2,+-.- a 


Let N = > 1, 


=>8 (x "| exp (-6 b Nga — “)) 


= ) BN exp (-6 b NgfEq — »)). (1.46) 


But 
<N> = rc YN xp (-#|¥ n{& — »)). (1.47) 
__1 12, wi -% 
P= 50% on pon? an 


This equation gives (N) = (N(u)). Inverting it gives as a function of (N).* 
Similarly, let (n,) be the average value of n, for a given y (or a given (N)). 
Then <n,) = €g/f, and <N) = DY, <n,>. 

For the purpose of computing the probability of a state, we assumed that 
the system acts as if it had energy E = ¥,1,(e, — py). But the y was inserted 
Only in order to provide for the weighting factor that accounts for different 
Probabilities of different numbers of particles in the system. For such purposes 


* Problem: The function 


{wes 


v ehlev— w) —1 


has many plus-minus infinity points, and therefore the equation f(u) = N has many 
Solutions. Why do we still talk about a unique chemical potential for bosons? 
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as computing the pressure, we do not consider p to be part of the energy of the 
system. 


Ge ; > (x nate) exp \-8 b ng(Eq — wl} = a + yiN>. (1.48) 


Pressure = P = a Mea (5 y nate) exp \-8 b n{Eq — al} ; 


can be treated as a variable or as a function of Vand (NV). If we treat it as an 
independent variable, so that g = g(V, pu), we get 


U 


“=const. 


If we treat p (and g) as functions of V and (N), that is, with g = g[V, n(V, <N))], 
then 


—0g Ou 
eV <N)> =const. ‘ > ov 
is easily shown. Also, 
0 0 
28) een |, 
ON |y OXN>|y 
If F = F(V, (NY) = g(V, WV, <N))) + (NU, CNY), then 
—OF OF 
= ~— and w= > . 
ov <N>=const. d<N> V =const. 


Entropy =S=-k YO P In P, 


N1isM2, eae 


N1M2, 2 we M1,NQ, 046 


where 
1 
Pais ast ow exp E oy ng{Eq o | . 


It is easily shown that 


at 
oT ‘W Leoiat. 
In summary: 
g= o In Q®) = i In ¥ exp \-8 bP nf — ol} ; (1.49) 
<n = ce (1.50) 
08, 
(Ny = 99 = ¥ dn, (1.51) 
Op a 


1.8 Evaluation of integrals 


yal ¥ (x rte) exp | al ee wt x EAs LENS, 


7 ORE: op 


p= 5 5 (4 Ena)ew{-0] mc - vo]! 


a 


HV 


29 


(1.52) 


(1.53) 


(1.54) 


Soon we will be able to find g for an ideal Bose gas, and then for an ideal 


Fermi gas. But first we must evaluate some integrals. 


1.8 EVALUATION OF INTEGRALS 


We will soon be dealing with integrals of the form J®,, e~*”’ dx and J®,, x?e~*” 


dx. Let us pause for a second to calculate these integrals: 


ro) 2 roe) foe) 
I ee ax] { er ax | e” dy 


2% fo = 
=i | | e~"(—2r dr) dO = [Vx]. 
0 Jo 
| e* dx = Vn. 
| eran = Al e” dy = Vn 
mt les cP 
me _» da 
= ANE yg 3 
da Va 
0 (odd function) 


——, 
8 
8 
x 
we 
a 
( 
a 
% 
au 


ioe) 4 - d? foo) * 3 at 
we ie x? e- dx = 3a75/?/z, 


2 
Ss da” J_w 


and so forth. 


ia) fea) 2n 00 
| | e +9) dx dy = | | e" r dr dO 
— 0 -2 i) i?) 


(1.55) 


(1.56) 


(1.57) 


(1.58) 


(1.59) 


(1.60) 
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1.9 THE IDEAL BOSE-EINSTEIN GAS 


From Eq. (1.45) we have 
e fa = YY exp {—B[n,(e, — w) + noe. — w) + °° iS} 


N1,M2,.-. 


= Yexp [—Bny(e, — )] Lexp[—Bnle.- w]--- 1.61) 
For a Bose-Einstein gas, n, = 0,1, 2,... 
Serer) ce iam (Bose Einstein), (1.62) 
= 1 
ee = J] —_____ (1.63) 


{1 — eT hleimK)? 
= > Ind — e7 Ferm), (1.64) 


Consider a particle contained in a box but otherwise free. The number of 
modes in the box with momentum in the three-dimensional region d3p is 
s(d°k/(2x)°)V, where V = volume and s is the number of possible spin states 
(e.g., § = 3 for spin 1 when the rest mass does not vanish; s = 2 for spin 1 when 
the rest mass vanishes, as for photons). The energy is 

p? hi2k2 


—&=— = — 


2mm” 
Here we have approximated the discrete set of modes by a continuum (which is 
the case for a completely free particle). The sum for g can then be replaced by an 
integration: 


1 e~ 2 a3 Pp 
= s— | In(1 — e7FP?/2mebu V 1.65 
g al ae (1.65) 
N 10 e7 Fp? /2m Bu 3 
ig ce AS ia eal 2 (1.66) 
V V on 1 — e7 FP?/2mebu (27h) 


For computational purposes, set « = e?" and x? = Bp?/2m. Then p? = 2mx?/B 
and dp = (2m/B)'!? dx. 


2 i 3/2 
1 : dyes 4zp a _ : , 2m ji: 
aie (22h) 2n*h° \ B 


= icemlaee G) | 
1 2m x2Sqe7? 2,- 2x2 
ey” ||» {ae"* dx + ae at} 
+ 


mkT\3!2 ot? a3 
= 5 ea E 3372 + 33/2 = a | (1.67) 


> 
| 
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where we have used [®,, x2e7®? dx = (/n/2a)(1//a), from Eq. (1.58). Letting 


Lo=-y 5, 


kT\3/2 
= (sa) £50. (1.68) 


we obtain finally 


Thus, given p, we can find « = e* (in principle) by solving the equation 


2nh?\7'?_ p 
€3/2(0) = A =) T 3/2 : 


For the total energy of the system we have 


ie gD Sn 


= thee fi 0 y 2? 
“1 = en FP2mebe  (2nh)3 


= s§kT we (ey VL5/2(0). (1.69) 


For very low p, high T, or both, p/T*/” is very small; then C3,2(«) is very small, 
and thus @ is very small. In this case €5,(%) ¥ a, €5).(%) © a, so that 


€5/2(%) ~1 
C3/2(%) 
and U/V ~ (3/2)kTp. This condition therefore represents the classical limit. 


Now let us look at the other limit. As T becomes lower and lower, with p 
fixed, ¢, j2(a) becomes larger and « approaches 1. For « > 1, ¢ diverges. 


1 1 
Csa(l) = 1 + 3 + gaa t 11 = 2.612, 


1 1 
SspD=l= sat pat = 134i. 


The temperature T, at which « = | is called the critical temperature for 
Bose-Einstein condensation. 


3/2 
mM ap (mr) (2.612), (1.70) 


2 2/3 
Se ee. (1.71) 
mk \2.612 
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The question arises: Why does our analysis break down at T,? The answer is 
that for such low temperatures (T < T.), we cannot replace the sum, Eq. (1.64), 
by the integral, Eq. (1.65). 

Looking back at g and <N)> before we approximated all summations by 
integrals, we recall that 


ee = In (1 — e7 FeiehH) = *y In (1 — eFax); 


= — Bei 
Ni a mgs a a 
Mm TL — eo Fig = Ha teh — 1 “FT eT Pugh _ 

For small a, e~# is large; the terms with the lowest ¢; do not contribute 
much to the sum, and we can replace the sum with an integral. When e7 * is 
small, we cannot replace the sum with an integral because the first few discrete 
terms in the sum are important. Now 


0g 1 
(Mad = 95 = grt) = 1" 
Because (n,) is positive, (¢, — ) must be greater than zero. For (N) to be 
larger than its value at T = T,, (€, — p) must be positive, but very small, in 
order that the low-energy terms in (<N) be nonnegligible. Assuming no acci- 
dental degeneracy in the lowest level (that is, assuming &9 # &,) we can conclude 
that, for sufficiently low temperature, 


1 1 
<m) = elei-w) _ s eho) 4 = No). 


Without any loss of generality, we may take &) as the zero of the energy. Then 


1 1 kT 
Noy = — = KT Inf 1 t+ ~a— for large ‘ 
<Nno> oon | BL ( a diigd ge <No»> 


For low temperatures, p is very close to zero, so for the energy states above é,, 
we can neglect ». The sum for <N — no) can be approximated by an integral: 


— 1 d>p 1 
Nexc = (<N — ty = — x sV = ; 
€ o» 2 - ee eP {2mkT __ 1 


mkT\3/? T\3! 
2.612 V = CNY[(— : 
ae sree (7) 


We see, then, that our definition of the critical temperature is such that when 
T < T., there must be a nonnegligible fraction of the particles in the ground 
state: 


No) = <N> aa Wise 
= (NOU — (7/T.)*"71- 
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Cy Cy 


T. 217° K 
7” -T ba > 7 
Bose-condensation Liquid helium 


Fig. 1.7 A Bose-Einstein gas around T, compared with liquid helium around the 
lambda point. 


For 7 < T. the total energy is 


kT\3/2 toa) 
U = 37s (™ Veg (1) = 8kT Nexo 5? 
3 (5 7 fsa) = ART Nae SS 
= 3KTN,x-(0.5134) 
T 3/2 
= 3kT (0.5134) (7) <N). (1.72) 


Then Cy = GU/6T o T*!? at sufficiently low temperature. 

For T > T,, we have <N) = (mkT/2nh?)?"€3,.(a)sV. Using the definition 
of T., we have €3/.(a) = 2.612(T,/T)*/”. This equation determines « from (N) 
and T. Then 


C5 /2(a) 
= 3kTCNy 32, 1.73 
wy €3)2(%) : ! 


The behavior of the specific heat of a Bose-Einstein gas around T, is rather 
Similar to the behavior of liquid helium near the so-called “lambda point” 
(A point). See Fig. 1.7.* For a given p, T. and A are very close to one another 
(the lambda point is a few degrees K). Because of the mutual forces between 
Particles, liquid helium is certainly not an ideal Bose gas, but perhaps part of the 
explanation of the lambda transition involves Bose condensation. 


*For a calculation of the discontinuity in (€C,/éT), at the transition point, see 
s D. Landau & E. M. Lifshitz, Statistical Physics, problem of §59, Pergamon Press, 
59. 
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A more exact description of the behavior of liquid helium near the lambda 
point is given by the empirical formula: 


C. & a+ bin|T — Ty}, T < Th, 
EO le abla IP Hh. TS 


The explanation of this behavior is left as an exercise for the reader. If successful, 
publish! 


1.10 THE FERMI-DIRAC GAS 
For a Fermi gas, we proceed exactly as for the Bose case up to the point 


ef = > e~ Bmi(e1- 4) y e Bn2lea-W) 2. (1.74) 


ny nz 
Here n, = 0,1; 2, = 0,1; and soon. Thus 


e7F@ = TT (1 + en Mem) (1.75) 


© 
ll 


= @P In (1 + 7B), (1.76) 


Once again, we can approximate the sum by an integral. This time there is no 
danger that a sizable fraction of the particles will be in the lowest state. Assume 
that the gas is of electrons, so that s = 2. 


1 ~ B(p2 2 a p 
=- In (1 + e7 B(r*/2m~n) V~, 1.77 
f (6) | nh) me 
— B(p?/2m) Bu 3 
pee eee e d =e rl (1.78) 
V V ou 1 + e7 AP?/2mobu (27h) 
e7 Blea~#) 
<n) = 2 = : (1.79) 


6, 1 + e” Ba- » ef Ga- #) + 1° 


The form of <n,» for T = 0 and for any T > 0 is shown in Fig. 1.8. Now at 
T = 0, (n,> = life, < w; and (n,) = Oife, > p. Let p = po at T = 0.In 
other words, at T = 0 all the states with energy less than py are occupied, and 
all with energy greater than po are empty. fy is called the Fermi energy and j19/k 
the Fermi temperature. 


_ (0, ( dnp? _ 7 (4nPo\ 1 
a [. (ors) ap 2/ 3 lene ae) 
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na T=0 14 T>0 


Fig. 1.8 Occupation of states at T = O and T > 0. 


Se) 


if 
My e=p*/2m 


Fig. 1.9 The density of states as a function of energy. 


where fig = p2/2m. For most “reasonable” values of fg and T, po > kT (for 
example, jio/k for copper is 82,000°K*). 
Since 
2 3/2 
> 4np ap 2: 4n(2m) WP Aes 
(2nh)> (2nh)° 


the density of states with respect to energy is 


4n(2m)?/? de 


f@® = (nh)? 


This function is shown in Fig. 1.9. 
U _ 1 (afg By eee Wey 
u=— = —[-= 4+ yN)= : 1.81 
V (4 . ) Gr 1 + e A(r2/2m—w) (2nh)3 ORD 


* Also, in states of very condensed matter, as in a neutron star, 4o/k of the neutron, 
proton, and electron liquids are ~10!° — 1011°K whereas T ~ 10®°K. 
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At T = 0, 
bee P 4np? dp 4xn 1 Be ay 
cis \ (7) 2/ (nh? )- (Qnh)) 2m- (i pap ) 
An 1\1 35 
= (nh? Gals 5 Po> (1.82) 


SHo. (1.83) 


ae a 
ao \esrlen) 5) _3(0). 


p An ZS 3\2m 
(2nh)? 3° 


We will now tackle the case of finite 7. For T 4 0, what is p and what is the 
specific heat, etc.? 


ZAKI os TOU e Meme 2d p (1.78) 
V V op 1 + 7 AP7/2mobu (27 h)3 ” 
2 — B(p?/2m) Bue 3 
“ (p*/2m)e ef" 2d>p (1.81) 
1 + e7 AP?/2mobu aay 


For convenience, we wish to express these integrals in terms of the energy e: 


e = p/2m, J2me=p, dp = Vm)2« de. 
dp becomes 4np? dp de = 2n(2m)?/?/e de. Thus, 


of J 4n(m)*? 7 a, = a vede 84) 


0 FM) +1 (22h)? 9 eM 4 1 
and 
foe) 3/2 oO 3/2 
u= : i mem elde=aq| —~—% ge , (1.85) 
FEM + 1 (20h)? o PE 4 I 
where 
_ 4n(2m)?? 
(Qnh)> ~ 
In both cases, the integral that must be evaluated is 
1a [7 _s@ae 
5 efe-H) 4 4 , 


where g(e) = aVe for p, and g(e) = ae?/? for wu. Then 


°  g(e) de °  g(e) dé B #  g(e) de 
I= — a de— | ————_.. 
| ee) 4 | | PEO +1 Sy ate) 5 e Fem 4 1 


In the first term let x = B(e — yp), and in the third term let x = — fle — p). 


= g(u + x/B)dx _ (glu — x/B) dx 
r= [ae ae + |” Sa B {. aa a (1.86) 
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For what we have called “‘reasonable” values of p and T, not much error will 
arise if we replace g(u + x/B) by g(u) + (x/B)g'(u) and ff" by fo. Then 


: a ge) de + 2 el ey aie ik ge) de+ 79 187) 
0 


6p? 
° x dx 
o etl 
can be evaluated by expanding xe~*/(1 + e7*) ina power series, then integrating 
and summing; the result is 27/12.) So, 


(The integral 


I - |" g(e) de 2 ; map? g'(u). (1.88) 
0 


Remember that for J = p, g(e) = ave, and for J = u, g(e) = ae?!” 


p= al" Vede + = ery 4. 
si 
7) 3/2 4 3. 
ay Geiae = (T) (1.89) 
(3 py 
But we have also p = (2a/3)u3/* from Eq. (1.80). Thus 
n k?T? 

eta © 1.90 


ye 2 ae 
ual o? de + © aT)? Op 
0 6 2 
> es 5 34 
=a ET) 
rie 5 | ) 5 NH 
2 s2 5/2 a (kT)? an? 2/,. 
a —a — + — (kT J 
5 “Ho Ho 12 ue 7 (KT °C Ho 


2 nies 2 os 
= Uy + Vio kT)? = ug +9T?; y= “EN Ho kK. (1.91) 


u =U + yT?; =wW = Uy + yVT? = Uy + y'T? 
6U 
Cy =< = 2y'T, 
v ar ? 


Cy of a metal =-2y'T + «T? atlowT 


elect. lattice 
vibrations 
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At very low T the »’T term dominates, and the contribution of the electron 
gas to the specific heat is noticeable. 

C, of a metal = 2y’T + 3Rat high T. Since the part of the specific heat due 
to lattice vibration is constant, the y’T term is again detectable. 

Note that at very high 7, we have to include higher terms in the expansion 
of g(e), and the part of the specific heat due to the electrons is no longer propor- 
tional to T. 

We can demonstrate more easily than we have done above that the internal 
energy is proportional to 7”. At any temperature less than the Fermi temperature 
almost all the states with energy less than the Fermi energy are filled, and almost 
all states with energy greater than the Fermi energy are empty. As the electron 
gas is raised to temperature T from T = 0, the average energy than can be im- 
parted to an electron is about kT. 

Also, only those electrons within about kT of the Fermi level will be excited, 
because electrons with less energy have no place to go—the states are filled. The 
number of electrons with energies between po and po — kT is 


Ho ; 
N= as'!? de, 
uo-kT 


For fg > kT, N is proportional to kT. Thus the internal energy added is pro- 
portional to (kKT)kT = k*T?. 


CHAPTER 2 


DENSITY MATRICES 


2.1 INTRODUCTION TO DENSITY MATRICES 


When we solve a quantum-mechanical problem, what we really do is divide the 
universe into two parts—the system in which we are interested and the rest of the 
universe. We then usually act as if the system in which we are interested com- 
prised the entire universe. To motivate the use of density matrices, let us see 
what happens when we include the part of the universe outside the system. 

Let x describe the coordinates of the system, and let y describe the rest of 
the universe. Let g(x) be a complete set of wave functions. The most general 
wave function can be written 


W(x, y) = py Ci y)ex). (2.1) 


At this point we will convert to Dirac notation. 
Let |g;> be a complete set of vectors in the vector space describing the 
system, and let |0;) be a complete set for the rest of the universe. 


p(x) = <xlg:) and —(y) = ¢y18)). 
The most general wave function can be written 
> = > Cijli>10;> (2.2) 
W(x, y) = Cyl<xly> = 2 Cixlor<v18;> 
We can obtain Eq. (2.1) by taking 
Cy) = py Cij< y18;). 
Now let A be an operator that acts only on the system; that is to say, A does 


not act on the |@,>. When A acts on product states (for example, |y)) we really 
mean Ala)|b) = (Ala))|b). In such a case A does not equal 


Y Avled<oel, 
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but equals 
> Ail g>|9><O |< Ge- 
Then 
<A> = HIAly> = CF Cr5X 9191 Alpi >19;> 
ij’ 
= » CHC, <A AlQ,> 
= > <glAlor> pi (2.3) 
where 
pir = ¥ CKC,; = density matrix. (2.4) 
ij 


We define the operator p to be such that p;; = (g;|p|g;). p operates only on 
the system described by x. 


CWIAlW> = Yi <@ilA » ler><Gr lle 
= % <eilAplo:> = Tr pA (2.5) 
Where we have used the result 
Y l9:><grl = 1 (by completeness arguments). 


From Eq. (2.4), it is obvious that p is hermitian. Therefore it can be diagonalized 
with a complete orthonormal set of eigenvectors |i) and real eigenvalues w,, 


p = Ywilid<il. (2.6) 


t 


If we let A be 1, we obtain 


yw; = Trp = <A = Cl) = 1. (2.7) 


If we let A be |i’><i’| we have 
wy = Tr pA = (A> = GHIA) = YE (KW IODKO I) 


= VIKHMODWV?. (2.8) 


Therefore, 


i 


20 and Yw=1. (2.9) 


We now consider the concept of a density matrix independent of the pre- 
ceding motivation. First let us reformulate quantum mechanics: 
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Any system is described by a density matrix p, where p is of the form 

¥; wili><i| and 

a) the set |i) is a complete orthonormal set of vectors. 

b) w; = 0. 

ce) Liw; = 1. 

d) Given an operator A, the expectation of A is given by 

(A) = Tr pA. 

Notice that 


(A) = Tr pA = Yi Ci'lpAli’> = YY wki'li<il Ali’) 


vi 


= wil Aliy. (2.10) 


Since (i{A|i) = the expectation value of A in the state |i), it is obvious from 
(b), (c) and Eq. (2.10) that we can interpret the w; as the probability that the 
system is in state 7. If all but one of the w, are zero, we say that the system is in a 
pure state; otherwise it is in a mixed state. It is easy to show that a necessary and 
sufficient condition for a pure state is p = p. If a system is in a pure state, 
lipure? WE Can express the matrix as 


p= [fpara?<tpurels 
Pi = <gilele;> an €@ilipure> Cépurel Pj) 


(@ilipure (CP jlipure>)™ (2.11) 
More generally, 


Pi = » WX Gilk><y,|k>*. (2.11a) 


k 


If it is possible to discuss the system in the x-representation, we can write 


pla’, x)" Cx'lplx> = Y wix'lid<ilx) 
= by w,i(x’)i*(x), (2.12) 
which, for a pure state |i), becomes 
p(x’, x) = i(x’)i* (x) (2.13) 


In the x-representation we write 


<A> = Tr pA = [aecxparn, 


42 Density matrices 


But 


GxlpAlx) = <xlp ({ ax'be <x!) Alx) 


a | aGotebx>@r4be) = | eet x')A(x’, x). 


<A> = fe dx’ p(x, x')A(x’, x). (2.14) 


Considering again the problem of a system plus the rest of the universe, we can 
easily show 


<A> = [ve y)A(x’, x)W(x, y) dx dx’ dy. 
We see that in this case 


ple, x) = [vee WAC, ») dy. (2.15) 


From our example in which we split the universe into two parts, we see that 
pure states are not general enough to describe a quantum mechanical system that 
does not include the whole universe. It is unknown whether or not the universe 
is in a pure state. To reformulate quantum mechanics in terms of the more 
general density matrices, it would be appropriate next to find the equation of 
motion of p. But first, as a simple example of a density matrix, let us try to 
describe polarized and unpolarized light. 

Consider a beam of light travelling in the z-direction. First define 


(0) Wave function for the x-polarized state 
(2.16) 


(7): Wave function for the y-polarized state 


Any pure state can be written as a linear combination of the two states in Eq. 


(2.16): 
(B)=4(0) + 8()- (2.17) 
b 0 1 


jal? + |b/? = 1. 
When we use Eq. (2.11), the density matrix for the pure state Eq. (2.17) becomes 


e= (7 ne (2.18) 


where 


ba* bb* 
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In using Eq. (2.11), we took 


lo) = (,) and lg2> = (}): 


Now to examine four pure states and the density matrices corresponding to 


them: 
The x-polarized state: Leta = 1 andb = Oin Eq. (2.17). Equation (2.18) gives 


1 0 
= : 2.19 
poo. = (5) 2.19) 
The y-polarized state: Let a = 0 and bd = 1. 
0 0 
pss fs i); (2.20) 
The 45°-polarized state: a = 1 2and b = 1 IW 2 gives 
P45°-pol. = (} :) : (2.21) 
The 135°-polarized state: a = — 1// 2 and b = 1// 2 gives 
P135°-pol. = & “): (2.22) 


The p’s in the last four equations are for pure states. Now consider the following 
two mixed states: 


1. Mixture of 50% x-polarized and 50%, y-polarized states: From Eq. (2.11a) 
the density matrix for this mixture is 


0 
p= 4 x-pol. 4Py.pot. = t ). (2.23) 
04 
2. Mixture of 50%, 45°-polarized and 50% 135°-polarized states: The density 
matrix for this mixture is 


0 
P = 4Pas-por. + FP135-po1. = t ): (2.24) 
Thus these two mixtures have the same density matrix and show the same phys- 
ical effect. Note that a given pure state (for example, x-polarization) determines 
@ State vector (wave function) only up to a phase factor, whereas the density 
Matrix is determined uniquely. 
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2.2 ADDITIONAL PROPERTIES OF THE DENSITY MATRIX 


Recall that the density operator can be written 

p = y wilid<il, (2.25) 
where the system is in state (i) with probability w;. As time changes, the possible 
States of the system also change; so 


p(t) = ¥ wili(t)><i(o)]. (2.26) 
It is easy to find how |i(t)) changes in time, for we can expand |i(0)) in eigen- 


states of the Hamiltonian, H, and we know how those eigenstates change in time. 
Let HIE,) = E,lE,)- 


li) = ¥ 1E,><E,1i(0)>. (2.27) 
It follows that 
li(t)> = ¥ Ee" "CE, |i(0)>. 


For convenience we use units in which # = 1. If we define {(#) for any function 
f by the equation f(H)|E,) = f(E,)|E,)>, we can write 


li()> = Ye ME,><E,|i(0)> = e™|i(0)). (2.28) 


Notice that, according to our definition of f(A), if f(E,) can be expanded in a 
power series for any eigenvalue of H, then (H) can be expanded in the same 
power series. 

Substituting Eq. (2.28) into Eq. (2.26), we get 


p(t) = Y we "i(0)><iO)|e™ = eM p(O)e'™. (2.29) 


Taking the derivative of p with respect to time, we get 
p = — i(Ap — pH). (2.30) 


You may be suspicious about treating H like a number when taking the 
derivative of terms such as e~‘”', but such a procedure can be justified by 
expanding e~'“" in a power series. Alternatively, we can obtain Eq. (2.30) from 
Eq. (2.29) by taking the derivative of (E,|p(t)|E,,>. In any case, Eq. (2.30) is 
true, and it plays the same role for density matrices that Schrédinger’s equation 
plays for wave functions. It should be noticed that an observable A in the 
Heisenberg representation obeys 


A = +i(HA — AH). (2.31) 


22 Additional properties of the density matrix 45 


The signs of Eq. (2.30) and Eq. (2.31) are opposite. We could put Planck’s 
constant back into the equations by replacing every H by H/h. From Eq. (2.29) 
we obtain 
Tr p'(t) = Tr [e“™"(p"Oe™)] = Tr [(p"Oel™e“] 
= Tr p"(0) (2.32) 
where we use the fact that Tr AB = Tr BA. It follows that Tr p” is constant in 


time, and consequently Tr f(p) for any function fis constant. This result can 
also be obtained by noticing that 


p” = Y wild<il, 


fp) = YL fOwplir<il. (2.33) 


so 


Tr f(e) = & fw), and the w;’s are constant. [being the eigenvalues of p and 
thus unchanged by a unitary transformation such as Eq. (2.29)]. 
It is possible to define “entropy” by the equation 


“Ss? = — Dw, In w,. (2.34) 


A pure state has ‘“S” = 0, whereas very impure states have large positive 
“entropy.” The quotation marks are used because, by our definition, the ‘“‘en- 
tropy”’ cannot increase in time but must remain constant. 

Next let us try to get a bit more experience in dealing with density matrices. 
In ordinary quantum mechanics it was easy to find such things as the expectation 
value of the position, the expectation value of momentum, and the probability 
that a system is in a given state. 


(a) Expectation value of position: According to Eq. (2.14), if A is an operator, 
<A> = fe dx’ p(x, x’) A(x’, x). 


In this case A = x, and 


A(x', x) = (x'|A|x) = xXx'|x> = x d(x’ — x), (2.35) 
so 


{x = je Xp(x, x). (2.36) 


(b) Expectation value of momentum: Now we must take A = p. Since 


He aly, 
x 


«xi ply) = ia 
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we have 
; RaW 74 
A(x’, x) = — — d(x’ — x). (2.37) 
i 0x 


Integrate by parts; for any function f we have 


[reo 5. 86" ~ 29 as - | 80" - 9 Fe) dr = FO) 
Ox Ox 


<p> 


~[ as dx’ a d(x’ ~ x) i p(x, x’) 
i Ox 


_fh fax E p(x, | (2.38) 
i ox Aa 


(c) Probability that a system is in state |y): Finding the probability that a 
system is found in the state |y) is equivalent to finding the expectation value 
of the operator |y)<z|. To see this for the case of a pure state, notice that if 
the system is in a state |y), the probability of experimentally finding it in 
state |x) is 


KxW>l? = ld) = Widd<adIW. 


Therefore, the probability that the system is found in state |v) can be written 


P = probability = Tr ply><x| = Tr ¥ wili><ild><xl 


= Ywildilol? = <xlelo. (2.39) 


The equation 
P = Y will? 
is in accord with our interpretation of the w; as the probability that the 
system actually is in state |i). 
The probability density at value x. for a coordinate is the probability 
that the system is in state |x), which by Eq. (2.39) is p(xo, Xo). For a pure 
state |W), p(X, Xo) = W(x)”. 


2.3 DENSITY MATRIX IN STATISTICAL MECHANICS 


When |9;) is an eigenket (eigenfunction), and E; is the corresponding eigenvalue 
of the Hamiltonian A of the system, the probability that the system is in the 
state |p; is 

(1/O)e7 FF, (2.40) 
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Thus the density matrix is 


P= Ywil@.><Pal, Where Wy, = ; ete. (2.41) 
Alternatively, the coordinate representation is 
, 1 — BE; #7 
Ax, x’) = > Q eA); (*'). (2.42) 
Because H|:,) = E,|p,), we can write Eq. (2.41) as 
1 - pH e FH 
Ba velo KPal = ae (2.43) 
where 
e FF = g = Ye Fh = Tr eM, (2.44) 
so 
ef 
p= Tretia’ (2.45) 


The average energy of the system, U, can be written 
Tr [He *"] 
Tr [e-F*] © 


When we know the free energy and the energy of the system, the entropy is 
derived from 


U = Tr pH = (2.46) 


F=U-TS. (2.47) 


It can be shown that, if Hp is any other Hamiltonian, 


F < (A) — TS, (2.48) 
where 
_ Tr [He Pre] 
(Ho = Tr [e~ AH} 2 (2.49) 


and S, is the entropy calculated from Hp. Fin Eq. (2.48) is the true free energy 
Corresponding to the true Hamiltonian H, as written in Eq. (2.44). When we 
Write the free energy corresponding to Hy as 


Fo = (Hoo — TSo (2.50) 
and subtract Eq. (2.50) from Eq. (2.48), we may write 


,7r [(H_- Hye P] 


F<Fyo Tr [e7 PH] 


(2.51) 
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This theorem will be proved later, in Section 2.11. 
Now, we regard the density matrix as a function of B: 


p(B) = e PM/Tr 7 BH. (2.42) 
The unnormalized p is defined by 
py(B) = e PH, (2.52) 


In place of py we will hereafter write p. Next we will show that p(f) obeys the 
differential equation 


—6p/0B = Hp. (2.53) 
The proof is as follows. In the energy representation, Eq. (2.52) can be written as 
px Oye F™. (2.54) 

Differentiating this equation, we have 
— 09,08 = 5;,Ee °" = E,pi;, (2.55) 


from which we obtain Eq. (2.53). 
The initial condition for Eq. (2.53) is 


p(0) = 1. (2.56) 
We can write Eq. (2.53) in the position representation as follows: 
— Op(xx'; B)/OB = H,p(xx’; B). (2.57) 


Here the subscript x on H,, indicates that H,, operates on x in p(xx'; B). The 
initial condition is 
p(xx’; 0) = d(x — x’). (2.58) 


2.4 DENSITY MATRIX FOR A ONE-DIMENSIONAL FREE PARTICLE 


We want to solve the differential equation discussed in Section 2.3 for a one- 
dimensional free particle. The Hamiltonian is 


H = p’/2m, (2.59) 
and the differential equation (from Eq. (3.57)) is 


=, , _ fp2 2 
Se = ‘ o(x, x’, B). (2.60) 
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This is a diffusion-type equation, and we can write down the solution readily: 


p(x, x"; B) = | ae zo | -(sara) (x - vl, (2.61)* 


The numerical factor is chosen such that 


p(xx’; 0) = d(x — x’). (2.62) 
For a linear system of length L, the trace of Eq. (2.61) leads to 
kT 
ePF = | p(x, x) dx = /@ 2.63 
{ px, x) mar (2.63) 


This is the partition function for the system. 


Problem: Show that the partition function for a three-dimensional system with 
N particles is 


en PF = VM mkT/2nh?)>. (2.64) 
Another way of deriving Eq. (2.61) is as follows: 
p(x, x’; B) = Ye Pm, (x) ’).- (2.65) 


If the particle is in a large box of volume V, we take 


_, [ Yaep , 2 Bt eh 
~ (2nh)? and W(x) > Yp(x) = Tes 


Then Eq. (3.65) becomes 


Che ee ane 
X, x’; = e p (/2moipih (x-x’) 
pC B) Cnty 


= (; mae eta (2.66) 
v4 


as in Eq. (2.61). 


25 LINEAR HARMONIC OSCILLATOR 
The Hamiltonian for a linear harmonic oscillator is 


H = p?/2m + mw’x?/2, (2.67) 


a S if Fs a se: 
; Note that for x,x’ constant, no expansion in powers of £ exists near 8 = 0. This is, 


Indeed, an important feature of p,,.. and will influence the construction of perturbation 
Stries, 
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so that the differential equation for p is 


—6p  —h? @ mo”, 


= + : 2.68 
ap 2maxr?” 2 *? eee) 
Let us write 
mo hw ho 
i) fy. . pe PO ae 2.69 
g roe z 3 B LT (2.69) 
which simplifies Eq. (2.68) to 
— op —6*p 2 
—~ + . 2.70 
if 38 &*p (2.70) 
The initial condition is 
p = (x — x’) for f= 0, (2.71) 
or 
p= | oe E-&) for f=0. (2.72) 


The factor comes from the scale change in the 6-function. In general, 6(x — x9) = 
If’ IL L(x) — £(%o)] for f monotonic. 

For low f (high temperature) the particle should act almost like a free 
particle, as its probable kinetic energy is so high. Therefore we expect that, for 
low f, the density matrix for a harmonic oscillator will be given approximately by 


Ag, &.f) = Vmer|(antifye S814, (2.73)* 
Knowing the Gaussian dependence of pz,.¢ on €, we try the following form: 
p = exp {—[a(f)d? + B(A)E + c(f)]}. (2.74) 
Use of Eq. (2.74) in Eq. (2.70) leads us to the equation 

dé? 4+ BE + ci = (1 — 4a7)é? — 4abé + 2a — B?, (2.75) 

where a prime (’) means a derivative with respect tof From Eq. (2.75), 
a’ = 1 — 4a’, (2.76a) 
b' = —4ab, (2.76b) 
c = 2a — B’. (2.76c) 


* This is only true for € # ¢’. When they are equal, p(é, ¢’, f) is equal to p,,., only 
for f = 0. See Section 3.10 for details. Obviously, when T is large and the classical 
motion has large amplitude, the larger the amplitude the less will the oscillator feel 4 
force between fixed x and x’. 
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Equation (2.76a) is integrated to give 
a = (3) coth 2(f — fo), 


where the initial condition Eq. (2.73) requires that the integration constant fy 
vanish: 
a = (3) coth 27 (2.77) 


Equations (2.76b and c) are integrated as 
b = A/sinh 2f. (2.78) 
c = (4) In (sinh 2f) + (A?/2) coth 2f — In B, 


where A and B are constants. Use Eqs. (2.77) and (2.78) to get 


exp |-(5 coth 2f + Ag + A’ coth ~)]. (2.79) 


p= : 
Vsinh 2f sinh 2f 2 
When we let f > 0, Eq. (2.79) gives 
2 2 
pases |e). (2.80) 
Jf 4f 


For this result to agree with Eq. (2.73) we see that 


mo 
A=-€, B= |—. 2.81 
s a (2.81) 


Using Eq. (2.81) in Eq. (2.79), we obtain the final form: 


’ mo 
ep) = Ie 
lx, x'5 8) = sinh 2f 


xX exp iran & + x’) cosh 2f — 2x’ |} (2.82) 


or 


Ry = soa EE ee 
P(x, x’; p) = _— sinh (ha/kT) 


exp eens ees (x? + x?) cone — 2xx' |S. (2.83) 
2h sinh (ha/kT) kT 


The special case when x = x’ gives 


" m@ —m@ > 
/x; B) = : tanh f ). 2.84 
Oa? pe Go oe er f) oe) 
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This is the probability for finding the system at x. Notice that it is of Gaussian 
form. 
Equation (2.84) is used to calculate 


coth f = —— coth 
J (x, x) dx ~ 2mw f 2m 2kT 


(ey fx?o(x,x)dx ht h ho (2.85) 


The average of the potential energy is thus 
Potential energy) = (4)mw?<x”) = (hw/4) coth f. (2.86) 


We may compare this result with the average of total energy calculated by means 
of the partition function. We know 


ho foe"  fiatl+eF ta 
ee ere a ee 


Equation (2.86) is exactly half of Eq. (2.87). Therefore we know that 
(Kinetic energy) = (Potential energy) = (hw/4) coth f- (2.88) 
The partition function is derived from Eq. (2.84): 


eFF — | p(x, x) dx = | dea 1A ee 
2nh sinh 2f\V mw tanh f 2 sinhf 


coth f. (2.87) 


which leads to 
= 5m (2 sinh f) = kT In (ef@/2#T — g-fo/2kT) 


= me + kT In(t — e7 MT), (2.90) 


This is the free energy, as already derived. 
We may examine the limit of high temperature, or small fin Eq. (2.84). In 


this limit 
—ma?*x?/2 ~V(x) 
eS e ———_—— |= ———— ], 2.91 
p(x, x; B) > x0 ( a ) exp ( 7 ) (2.91) 


except for the factor in front of the exponential. This result, Eq. (2.91), agrees 
with classical mechanics. 
The low-temperature limit is found by inspection of Eq. (2.83): 


—< 2 = 2512 
rex) [ew (Sie) (—ap)er(=—"g)> es 


This limit is calculated as 
p(x, x’) = Ye Fa (x)ph(x’) > ep o(x)p5(x'), (2.93) 
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because when f —> ©0, only the ground state has an effective contribution. 
Equation (2.92) agrees with Eq. (2.93) because we know that for the ground state 


1/4 
(x) = (*) el me] 2h) x? (2.94) 


2.6 ANHARMONIC OSCILLATOR 


Consider the potential energy shown by the curve in Fig. 2.1, given by the follow- 
ing function: 


2 
V(x) = — x? + kx’, (2.95) 


We will consider the area near x = 0 only, so that the region V(x) < 0 does not 
come into the calculation. 

Because of the anharmonicity, when the temperature increases the mean 
position of the oscillation moves out, as shown in Fig. 2.2. Bearing this in mind, 


Fig. 2.2 Mean position of the oscillation is displaced. 
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we will treat the problem using the minimum principle discussed previously. 
The principle is written as 


F < Fo + CH ae Hoyo. (2.96) 


Here F is the true free energy and H is the true Hamiltonian. When Hy is any 
Hamiltonian, (>) means the average taken in a system characterized by Hy. Fo 
is the free energy of the system of Ho. 

In the present problem we take for Hy 


Hy = p?/2m + (ma?/2)(x ~ a)’, (2.97) 


which describes a harmonic oscillator with its center displaced by an amount a. 
The @ in Eq. (2.97) is the same as that in Eq. (2.95). From Eq. (2.90) we find that 


Fy = kT In (2 sinh (Aw/2kT)). (2.98) 
Now we have only to find 
<H — Hoo = (™ x? + kx? — . ive - a) ; (2.99) 
0 


If we make the substitution y = x — a, Eq. (2.99) becomes 


2 
<H — Hoyo = (moray + a a? + ky? + 3ky’a + 3kya? + ka") ¢ 
") 


(2.100) 


Because Hy = p?/2m + (mw?/2)y?, we can easily calculate the expectation 
values of the various powers of y. By symmetry, 


{V0 = (¥*)o = 0. (2.101) 
From Eq. (2.85), we obtain 
<y?%o = (h/2ma) coth (hw/2kT). (2.102) 
Substituting Eq. (2.102) in Eq. (2.100) yields 
F < Fy + (H — Hy)o = Fo + (ma’/2)a? + ka? + 3ka (y*5o. (2.103) 
The best estimate of F from Eq. (2.103) is obtained by minimizing the right- 
hand side, In so doing we neglect the term ka* because we are interested only in 


small oscillations, so that V(x) < 0 does not occur. Then the differentiation of 
Eg. (2.103) with respect to a leads to 


moa + 3k<y*>_ = 0, (2.104) 
that is, 


~—3k _3kh coth (Heo[2KT) 


ae Ene = aaa (2.105) 
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In this case, Eq. (2.103) becomes 
F < Fy — (mw’/2)a’. (2.106) 


In Eq. (2.105), the displacement of the mean position, a, has a simple interpreta- 
tion. It is the point at which the average force on the oscillator vanishes. This 
result is seen as follows: from Eq. (2.95), 


Force = mw?x + 3kx? = mw?(y + a) + 3ky? +++- (2.107) 
(Force) = mwa + 3k{y*> +°°°. (2.108) 
Therefore, when we neglect the higher powers of a, we arrive at Eq. (2.104). 


Problem: Consider a general bounding potential V(x) like that shown in Fig. 
2.3. The Hamiltonian is 


H = p?/2m + V(x). (2.109) 


In order to apply the variation principle in Eq. (2.96), try the following Ho: 


2 2 
Hy = PE 4 Moo (x — ay?, (2.110) 
2mMo 2 


and vary a, mo, and wp». Show that 


i) my = m; (2.111) 
ii) Mp and a are determined from 
(V'"(x))o = 0, (2.112) 
(xV'"(x))0 = (p?/m)o. (2.113) 
Vix) 


| a 


Fig. 2.3. A general bounding potential. 
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Here <), is taken for Hy. Equation (2.112) comes from the consideration of the 
absolute location of the system, and Eq. (2.113) comes from the scale of this 
variable. The latter corresponds to the virial theorem, which states that 


(xV'(x)) = <p?/m). (2.114) 


To show how Eq. (2.114) is related to considerations of the scale of the 
variable, let us derive the virial theorem in a more general form. 

Consider a system inside a cubical box with sides of length L. The Hamil- 
tonian is taken to be 


2 
H,= DP + ¥ voy. (2.115) 
7 2m; yj 
i<j 
(We assume that the motion of the system can be described by a potential 
depending only on the distances separating pairs of the particles making up the 
system.) rj; = |r; — vil. 
If the system is bound, then for a sufficiently large box the pressure is 
essentially zero. But in general, 
eS eee (2.116) 
6 (Vol) 3° OL 
Let F, = F for a box of length ZL. When we later prove the theorem that 
F < Fy + (H — Ho)o, we will in fact prove that (H — Ho), is the first-order 
correction in the perturbation expansion for F. We get an expression for P by 
writing 


P = Pressure = — 


Fig+e © Fy + correction term 
or 

Frases © Fy + (Aratey — AD a,- (2.117) 
Now Hy: +.) differs from H, only in that (r;), can run from 0 to L(1_ + ¢) rather 


than 0 to L, and similarly for (r;), and (r;),. Therefore let us make a change of 
variables, such that ri = r,/(1 + ¢). As p? = —h?V?, we obtain 


p? 
Hya+e) = » Im “(1 i” a2 + 4 v(d te e)ri;] 


m; 


me yr Be +> bee é 2p Pe +E rv) | (2.118) 
72m; 0 ij. 72m ij. 
i<j i<j 


Since the new coordinates (primed) run over the same range as the original 
(unprimed) ones in Eq. (2.115), we can eliminate the primes over the r;; and P; 
and write 


CHra+e — Aida, = ‘| -2 <kinetic energy> + (z rive). (2.119) 
ij 


i<j 
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From Eq. (2.116) and Eq. (2.117), 


spyeese St spl el Gee: Gi) 
OL Le E 
From Eg. (2.119), 
3PV = 2 (KE) — (y rv) (2.121) 
Ue 
i<j 


For the case of a harmonic oscillator, P is zero and we get 


{xV'(x)) = <p?/m). (2.122) 


This result should be compared with Eq. (2.113). 
For the case of a particle moving in an inverse-square force field, we get the 
classical result 


(KE) = —(})<potential energy). (2.123) 
This method can be generalized. Let H be the Hamiltonian for the original 
system and let H, be the Hamiltonian with the box changed in an infinitesimal 


manner. Suppose Fy = Fy, and suppose that H, — H = eS. Then the above 
method gives (S),, = 0. For example, in Eq. (2.118) we have the case for which 


2 
Pi A 
S = | -23 3 + » nV eo]. 
i<j 


As another example, suppose that the box undergoes an infinitesimal transla- 
tion, ¢, in the x direction. We get, by taking x’ = x — e, 


pi? av 
H,® YS + Yi) +e Y — i), (2.124) 
72m, ij, Ox 
i<j i<j 
So that 
s=y Gq) (2.125) 
ij Ox 
i<j 
and 
(¥ ud ) = 0. (2.126) 
ij OX 


Compare Eq. (2.126) with Eq. (2.112). 
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2.7 WIGNER’S FUNCTION 


The density matrix may be written in the coordinate representation and in the 
momentum representation as follows: 


p(x, x’) = Yeo (x)g7(x’), (2.127) 
p(P, P’) = Le ol voip’) 
a [ oc: x jel NO APD dye dx’, (2.128) 
The diagonal element 
p(x, x) = P(x) (2.129) 


represents the probability for finding the particle at x (neglecting the normaliza- 
tion constant 1/Tr p). Similarly 


p(P, p) = P(p) (2.130) 


is proportional to the probability for finding the particle at p in momentum 
space. 

The expression Eq. (2.130) is used in calculating the average value of the 
kinetic energy: 


(fF) _- Te L(p*/2m)p} _ J o(p, pXp?/2m\(dp/2nh) —— 4 131) 
2m Tr [p] § p(p, pXdp/2nh) 


In classical mechanics, the density function f(p, x) in phase space has the 
following property: 


P(p) = | IG te 

(2.132) 
a ap 

P(x) = | fp. x) AE. 


We ask ourselves if there is any furfttion f(p, x) in quantum mechanics that 
satisfies Eq. (2.132). One answer is Wigner’s function f,(p, x): 


fw(P, x) = fo + . x— ") eo diy, (2.133) 


This function is derived by regarding p(x, x’) as a function of (x + x’)/2 and 
(x — x’)/2, and then writing (x + x’)/2 as x and making a Fourier transform 
with respect to (x — x’)/2. 
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Let us check that Eq. (2.133) satisfies Eq. (2.132). First, 


ap _ Ne ce oh —ipnjny AP 
[Jor ”) sah f(x ee fe = 
= | A: + a re ) &(n) dn = p(x, x) = P(x). — (2.134) 


Thus the second part of Eq. (2.132) is satisfied. Next, using Eq. (2.128) and 
Eq. (2.130), we have 


P@) = soy = [ cs xe“ DPE—*) ie dx! 


fol» + ss yr ") eo ipalh dy dyn = [ce y) dy, (2.135) 


where we have made the change of variables: 


x= yt n/2 
(2.136) 
x= y — 4/2. 
From Eq. (2.132) we see that, if A(p, x) is either a function only of p or a function 
only of x, then 


Gann | Jnr, np, ) Pd, (2.137) 
nh 


However, Eq. (2.137) is not true for a general function, A(p, x). 

Although Wigner’s function fy(p, x) satisfies Eq. (2.132), it cannot be 
regarded as the probability for finding the particle at the point x and with the 
momentum p, because fy(p, x) can become negative for some values of p and x. 


Problem: Calculate Wigner’s function for a harmonic oscillator. This is a 
Special case where fy(p, x) is always positive. 

In Eq. (2.133), we derived Wigner’s function from the density matrix. We 
can derive the latter from the former: 


fu p,~— eftoimyx—x) AP _ p(x, x’). (2.138) 
2 2nh 


Thus f,,(p, x) and p(x, x’) contain the same information. 


Problem: Show that the momentum representation of the free-particle density 
Matrix is 


pC, Pp’) = 2nhd(p — p'je PART: (2.139) 
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2.8 SYMMETRIZED DENSITY MATRIX FOR N PARTICLES 


We will now discuss the density matrix for a system of many identical Bose or 
Fermi particles. 


The symbols will mean 
D Distinguishable 
S Symmetric 
A Antisymmetric 


First consider the case of free particles. The Hamiltonian is 
H=----YVj. (2.140) 


The density matrix pp satisfies the equation 
Opplop = — Hp». (2.141) 


The solution is 


3N/2 
m m 
Pe ee eee -( ) exp ene a 


2nh*p ~ 2A BS 
(2.142) 
In general when particles are interacting with each other we can write 
Pd(X1s X20 +++ Xw3 X15 X95 +++ XN5 B) = et "1 XW WFO] + Xy)- 
(2.143) 


When the particles obey Bose statistics, then 


Ps(X1--- 5X, --. xy; B) = > FFA (x, + x(x, xy). (2.144) 


Symmetric 
states only 


We introduce the notation P to indicate the permutation of particles. For 
example, for a permutation 


12345 
Proll dt Lt 
$4312 
we have 
Px, — Xs. 
When the state w, is symmetric, 
W (Px) = Wir). 
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We can construct pg starting from pp. As an example consider the case of two 
particles: 


Po(X1X2; X4X2) = x, eT FFay (x, X_)Wi (4X4). 
all states 


Now interchange x{ and x}: 


Pp(X 1X25 X4X4) = ea eo FEW (x, x2) W#(X)x'). 
all states 


Take the average of the two: 
Bpp(%1X25 X4X2) + ppl%1X25 X2%4)] 


= Ye Pp (xyx)3[WF (x2) + WFO9%)] 


all states 


De Pb (x2) WF (x4 x4), 
symmetric 
states only 


because 


0 antisymmetric WF 
wi(x4x5) symmetric WF. 


AMF (xix2) + WFO2x1)] = } 
In general, a group-theoretical argument can show that 


i » W(Px, + °° Px,+** Pxy) = W(X4 6 Xe Xy) symmetric 
N!P 0 for any other symmetry. 


(2.145) 


Here 5°, means that we sum over all permutations. When we use Eq. (2.145), 
the case of N particles can be written as 


PSX Xe XN XL XR Xy) 
1 te , , 
= yy Polen eas Pays Px Pay). (2.146) 
Thus the partition function becomes 


~pF : 
e tem [ater omg ian en) dey ody 


1 
yi | Polen sees Peco age) dy dip (2.147) 
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Problem: Derive the partition function for a noninteracting Bose gas. Use 
Eq. (2.142) in Eq. (2.147) and first write 


eG _m_\pNr2 
N!\2nh?p 


x a op |, [(x, — Px)? +-++ + Oy — Px)? +°°- 
+ (Xv — Pas) I} dx,+++dxy. (2.148) 


In taking )p, we start with the identity permutation that gives V" from the integ- 
ral and next go to a pairwise interchange of particles, three-particle cyclic 
permutation, and so on. The combinatorial factors must be taken into account. 


Solution: Any permutation can be broken into cycles. For example, 


123 4 5 6 1 5 6\/2 3\/4 
r(i a i) = esoane = (1 { if i(:) 
S26. 5 6 I/\3 2/\4 


Suppose a permutation contains C, cycles of length v, where 3}, vC, = N. For 
vy > 1, let 


m 3v/2 
h, = a EDs dx, dx, ee dx, 
2nh*B 


x €xp ap [1 — x2)? + (x, — x3 toe: 


iat a) eo, “)'I} (2.149) 


3/2 3/2 
hy =(—" dx, =V(—=-) . 
2nh?B 2nh?B 


Then Eq. (2.148) can be written 


ema ty (1 ye). (2.150) 


1 
N! v 


It is not difficult to show that 


|e exp [—a(x — y)*] exp [—b(y ~ z)7] 


pa ae te Dn Na 
= 6 ri :) xp | ma ae z) |. (2.151) 
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Using Eq. (2.151) we obtain 
m 


3/2 
h, =V[(— ; 2.152 
Cae ( ) 


We must now count the possible permutations. Let M(C,,...C,) = the 
number of permutations that have 


C, cycles of length | 
C, cycles of length 2 


C, cycles of length q 


Then 
e FFs — al y, M(C,,...,C,) [] (A). (2.153) 
N! c1,€2,-.., Ca y 
As a preliminary to the determination of M(C,,..., C,), consider the case of 


N= 6, C, = 3, C3 = 1, Ch = Cy, = Cs = Co = 0. An example is the 
permutation P = (5)(3)(2)(461). Notice that the sequence Q = 5, 3, 2, 4, 6, | 
corresponds to P. In other words, from the sequence Q, we can uniquely con- 
struct the permutation P with C, = 3, C; = 1. But from Q’ = 2,5, 3,4, 6,1, 
or from Q” = 5,3,2,1,4,6, we can also construct in the same way the same P. 
In fact, it can be easily seen that there are 3! times 3 = 18 different sequences 
that can be used in the way described above to construct the permutation 
P = (5)(3)(2)(461). 

Generalizing to any set of C’s, we see that there are two ways we can get a 
new Q that leads to the same P: 


a) we can interchange cycles of the same length, 
b) we can make cyclic permutation within a given cycle. 
There are [], C,! ways of interchanging cycles of the same length, and [], v™ 


cyclic permutations within cycles. Thus for each P there are [], C,! TT, vw 
different Q’s that lead to the same P. Because there are N! possible Q’s, 


N! 
M(G,...,C,) = eeane: 
It follows that 
hc 
eMs=a yy. TI =2—; (2.154) 


€1,C2,...€q v C,! vo 


where >, vC, = N. 
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Instead of trying to do the sum in Eq. (2.154), we will find the free energy 
of the grand canonical ensemble (that is, we will let N vary). 


eS Ye ee (2.155) 
N=1 


As usual, let « = e*#*, Then, 


e AF y Il ger = > ll [A,(a’/v) : 


Ci,.0-Cg v Cty Ccivege ty cC,! 


where now each C, runs from 0 to oo. Interchanging [] and >, we have 


er = TE, MODE = [Jose (1,2) = (4.9). 


v Cy=0 


BF 


li 


eo) hya” m 3/2 a” 
ste tea) ae! 2.150 


This result is identical to the one we obtained in a much simpler way earlier. 
When the density of the gas is not large, only the identity permutation is 
important for high temperatures or for the limit of A > 0, since we have such 
factors as e~ mkT/2#yx- x1 This means that the quantum effect appears only 
for low temperatures. 
For the antisymmetric case, 


+ Larter oe ace s 
Pay Xq0 0 XN5 XY Xk Xn) 


1 , , , 
= Nie ODP 93 Px s+ Pxg** xy), 


where 
aye 1 for even permutation 
—1 for odd permutation. 


Problem: Modify the argument leading to Eq. (2.156) so as to obtain the cor- 


responding result for a noninteracting Fermi gas. 


2.9 DENSITY SUBMATRIX 


For N particles, the density matrix can be written 


P(X1, X2,-- +5 yi Xi, X9,--- Xn) = 2; WiPAX1,.--5 Xy)OF(X),--..XN). (2.157) 


2.9 Density submatrix 65 


Notice that we are not now specializing to the case of statistical mechanics. To 
compute the expectation value of P?/2m, we proceed as follows: 


2 2 
Pave. Trp Pi 
2m Tr p 2m 


= Trp am a 


dx,°*:dxy 


' 
Xp FX 
2 


h P 
= —__ ax pene x ¥ x’, = Ir ese : 2.158 
2m [ax 2 : Ox? 5 Pl 1 *) o 2m ( ) 


where we define 


x1=Xi 


S px, Xa5e++9 X35 X's X95 +004 Xy) dX2°** dXy 
Tr [p] 


fp; is the one-particle density submatrix. Using this matrix we can calculate, for 
example, 


Pk h? 0° 
(z Pe) = —-N (|< ax”? pil, | : dx = NTr Ee as | (2.160) 


(assuming that the particles ate essentially identical). 


p(x, x’) = (2.159) 


Similarly, 
(z vox) = N [ Yeon x) dx = N Tr [Vp,]. (2.161) 
k 


The two-particle density submatrix is defined as 


f paleika%s ys x XoXs Sy) axa? 2 xy 
Tr [p] 
From Eq. (2.162), the two-particle potential energy V(x,x,) has expectation 


(y Vix =)) = ee OE? [ Vowxaeate X93 X4X2) dx, dx, (2.163) 


i<j 


(2.162) 


P2lX1X23 X1%2) = 


Special cases of density submatrices are the distribution functions. The single- 
Particle distribution function is defined as 


Ps(x) = p1(x; x). (2.164) 
The two-particle distribution function is 


PAX, Y) = polx, V5 x, y). (2.165) 


- Ina uniform substance, such as a gas or a liquid, p,(x) is independent of x, and 
P2(x, y) depends only on the difference (x — y). 
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2.10 PERTURBATION EXPANSION OF THE DENSITY MATRIX 
Recall that the density matrix in statistical mechanics satisfies 


OP. a 


i Hp. (2.166) 


There are very few Hamiltonians for which we can solve Eq. (2.166) exactly, but 
it may be that H is close to one such Hamiltonian, Ho. 


H = Hy + Hy, (2.167) 
aay re (2.168) 
op 


We would like to use py to obtain an approximation to p, which we expect 
to be close to py = e~ #9, Because p is close to e~ °°, we expect e”°*p to vary 
slowly with f. 


AD = He! p + dn one eb Hip — eB Fp = — eb Hp, 
(2.169) 
Integrating Eq. (2.169) from zero to 8, and remembering that if 8B = 0, we find 


eFoBy = 1, 


i 


eb (fg) — 1 = — | * eHOP' HT (B') dp’. (2.170) 


Therefore, 
o(B) = po(B) — | * p(B — BH o(6') ap’. (2.171) 


The last term in Eq. (2.171) is small if H, is small, and it is a correction term to 
the approximate equation p ~ po. If the correction term is small, we can use an 
approximate p(f’) to obtain a much more accurate approximation to p(f). For 
example, if we make the approximation p(f’) ~ po(f’), we have 


BY xB) = | ” po(B — BVH pol") aB. (2.172) 
0 
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In using Eq. (2.172) as a new approximation to p(f’), we can find a still better 
approximation to p. Continuing in this manner we get 


8 
(NERDS | 4B'Tpo(B — B’)Hpo(B)] 


e) 


8 8’ 
+ | dp’ | dB"[oo(B — BH :po(8" — B"H s00(6")] 


B B B" 
- | ap’ | ap" | dp’, Jt: (2.173) 
0 0 0 
We can easily rewrite Eq. (2.173) in coordinate representation. For example, 
p(x, x’; B) = <xlp(B)Ix’> 


we polx, x's B) ot Cxlpo(B - p)( lx")x"ldx " H,polB1x") dp 
; (2.174) 
where we use 
* hx" x"| dx" = 1. 
If H; = V(x), then = 
Kx"| Ay pl BI’) = Vx"Spol(x", x's BY, 
and Eq. (2.174) becomes 


p(x, x’; B) = polx, x’, B) 
00 B 
- | [, pote 2158 — POV bo', 5) dB! da + 
-o JO 
(2.175) 
2.11 PROOF THAT F < Fy + (H — Ayyo 


Now that we have a perturbation theory, we are in a position to prove as 
promised in Section 2.3, that F < Fy + (H — Hoyo. Let V= H — Ap. 
Since F is determined by the equation e~ 8" = Tr e *#0*”) the most obvious 


approach is to find an approximation to F by using the perturbation expansion of 
e~AHO+V) — 9, 


B 
e7BHotV) _ g-BHo | eo B-wHoy puto dy 
i?) 


B fur 
+ | | du, du, e~ @-")HoVe-(1—m2)HoVe-wHo _... (2.176) 
0 i?) 
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Taking the trace of Eq. (2.176) we obtain 


& 
e BF = Tr e7 Plo +¥) nan e7bFo eels { Tr [e~ F- Foye wie} du 
Qo 


B fa 
+ | | du, du, Tr [e For Ro ye terme e~ sate] ome, 
0 JO 


(2.177) 
Using the fact that Tr AB = Tr BA, we find that 


8 
e7 FF = g AFo _ | du Tr e7 fHoy 
if) 


& fm 
+ | | du, du, Tr [e~PHog(1~us)Hoy g~(us—uaoy] s+. (2.178) 
Qo JO 


It is possible to simplify the second-order term. Let w = u, — uw, and let 
x = u,. Then the integral becomes 


| Tr [7 PHoevHope-wioy 7, (2.179) 
A 


where A is the shaded region in Fig. 2.4. Now let w’ = 8 — wand let x’ = B — x. 
Then the integral becomes 


{ Tr [7 Peete Ve- voy J. (2.180) 
i: 
Averaging Eqs. (2.179) and (2.180), we find, from Eq. (2.178), 
B 
eWFF =e AFo _ B Tr [eo PHoy] + | dw Tr [e~PHogwloye-wHoy | 4... 
0 


(2.181) 


Fig. 2.4 The shaded area A is the region over which Eq. (2.179) is integrated. 


Let |m) and |n) be eigenstates of Hy, and let E,, and E£, be their respective 
eigenvalues. Then 


Tr [eP¥9V] = ¥ <nle PV in> = Fe FF, (2.182) 
nt a 
Tr [e~P#oevFoyewHoy] = ¥ <nle~P#oe""oV ¥ |m><mle~ *#V [n> 
nn m 
= bx e BEngwEn— Emly Vs (2.183) 


where we have used the fact that >, |7)<m| = 1. Equation (2.181) becomes 


Em sca e BEn 


-B 
en FF me go PFo _ eo PEny EG e es Vogl? Eevee (2184 
xs py E,~E, | Van! (2.184) 
Note that if m = n, 
oom — eRe gat Em = 1 ete, 
E,, in E., E,, a: En 


In order to get an expansion for F, we write F = Fy + F, + F, + +++, where 
F, is a first-order perturbation and F, is a second-order perturbation. Then 


e@ PF — g7AFog~B(FitFat-* +) 


il 


2 
e FFocy — BF, — pF, + Pe, + F, ++)? +-++) 


ent E — pF, + & Ftp) + ce (2.185) 


up to and including the second order. We have a perturbation expansion for F 
if we set 
—e FoR F, = ->y peer, 
n 


and (2.186) 
2 ~BEm —BEn 
e-#o (8 2 — pr,) a bye 8 yy, 
(FF pra) = OE el 
Solving these equations we obtain 
Tr Ve~AHe 
Fy pas = Va (2.187) 


and 


2 >, e@” PEn 
B [= see al 


—BE, 2 —pEm = ~BEn 
Fy = ewe | me Hel? 


(2.188) 
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With the Cauchy-Schwarz inequality, 


Eat <] Dla] Ei], 


it is easy to show that for any set of positive real numbers, w,,, 


Lin Way . < Dn Wal al” 
ae Wr = ae Wr ; 
Applying this inequality to Eq. (2.188), we have 


— BE, 
——— |V,,|7 < 0. (2.189) 
Now, let 
H(a) = Hy) + av, 


and let F(«) be the free energy calculated from H(a). Then F(O) = Fy and 
F(1) = F. From the preceding perturbation analysis, 


F(a) = Fy + oF, + 0? F, + 0(a°). 
So F’(0) = F,. If we can prove that 
F"(a) < 0 for all a, (2.190) 


then F(a) will be concave downward (see Fig. 2.5) and thus it lies below the line 
F(O) + aF’(0) = Fo + aF,. From this we can conclude that 


Fey t+ Ff, = Fo + Vay 
or 
Fs Fy + (H — Hou (2.191) 


which was to be proved. 


4 FO) +aF, 


Fig. 2.5 The situation when F(«) is concave downward. 


211 Proof that F < Fp + <H — Ho>o 71 


To prove Eq. (2.190) we apply the perturbation analysis to F(a) for each «; 


that is, we write 


F(a + y) = Fo(a) + yFi(a) + y°Fi(a) + O(y%), 


where Fy(a) = F(a), and yF,(a) and y?F,(a) are as in Eqs. (2.187) through 
(2.189), with Ho replaced by H(«) and H by H(w + y) = H(a) + yV. The 


result of Eq. (2.189) is then 
F(a) < 0, 
so that 
d2 
F(a) = — F(a +) = 2F,(a) < 0. 
dy? 


7=0 


This proves Eq. (2.190) and thus completes the proof of Eq. (2.191). 


Applying this same theorem to Fy, we obtain 
Fo < F+ (-V)y. 
Combining Eqs. (2.191) and (2.192), we see that 
Fo + (Von S FS Fo + (Vu, 
A more accurate approximation F is obtained by including F,: 


B 
Fx Fo + Woo + Faves —3({ 


te) 0 


ew oY e- wHoy ‘w) : 


(2.192) 


(2.193) 


(2.194) 


CHAPTER 3 


PATH INTEGRALS 


3.1 PATH INTEGRAL FORMULATION OF THE DENSITY MATRIX 


Earlier it has been shown that the density matrix satisfies the equation 


Op(u 
h A) Sa: (3.1) 
and has the formal solution 
p(u) = e7 Bult, (3.2) 


where it will be noted that u has been redefined to be Bh, which has the dimen- 
sions of time. Furthermore, the “time” u can be broken up into divisions of 
length e(ne = u) so that p can be developed incrementally as follows: 


— He/hig~ He/h wee op He/fht 


p(u) =e e 


= p,p,°°* p,(n factors). (3.3)* 


In the coordinate representation for the density matrix p(x, x’; u) the solution is 
represented as 


p(x, x’; u) es i Fi [os Xn-13 €)P(Xn—1, Xn-23 6) ne p(X, x’; £)dx, oe AXy— 1. 
(3.4) 


The representation in Fig. 3.1 is useful in the interpretation of the above expres- 
sion. The particle travels from x’ to x through a series of intermediate steps, 
X14, X2..-,X,-1, Which define a “‘path”. The total amplitude p(x, x’, u) for the 
particle to begin at x’ and end up at x is given by a sum over all possible paths, 
that is, for all possible values of the intermediate positions x;. As the time in- 
crement é is allowed to approach zero, the number of integrations on the inter- 
mediate variables become infinite and Eq. (3.4) can be written symbolically 


p(x, x’; U) = [Joven Qx(u), (3.5) 


* The reason we want to reduce ¢ as much as possible is, of course, that the asymptotic 
behavior of p at high temperatures can be easily found (see below). 
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Fig. 3.1 A particle travelling from x’ to x along a path x,, x, etc. 


where 
®[x(u)] = lim P(X, Xn— 13 E)P(Xn— 15 Xn-25 &) °° P(%y, X'5 8), (3.6) 
and 
9x(u) = lim dx, dx,-+--dx,-4. 


noo 


Although all possible paths are to be considered in Eq. (3.5), the main contribu" 
tion comes from the limited set of paths for which none of the p(x, x’; £) is too 
small. For example, we know that for low «, 


m 2. eer 
p(x, x’; £) ~ ie ; e (m/2he)(x— x a 
TNE 


which is the density matrix for a free particle. The term p(x, x’; 2) is small if 


x~x' > / 2h/m/ &: so the main contribution comes from reasonably smooth 
paths. Now let us find ® more explicitly, first for a free particle and then for one 
Moving subject to a constraint. 


Free Particle: 


The Hamiltonian is 


H=-——. (3.7) 


From Previous results we know 


P(x, x’; 8) = i I g-tm/theyes—e') (3.8) 


2nhe 
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and 


2 

: me[ (x — x 

x, x’; U) = lim |--- | exp 4 ——— | (———"+ 
a al ac lara, 


dx, dx, dx,-1 
/2mhe/m J 2nhe|m \ 2nhe/m 


Thus, for a free particle, 


®[x(u)] = lim exp \-% (Gra freed G = ani (3.10) 
E70 &é E 


As the interval ¢ is made smaller, 


Xe Xk-1 _, dx(u) = Xu) 
é du u=ke u=ke 
and Eq. (3.10) becomes 
ae eee 
O[x(u)] = exp—— | —[xX(u)]? dus. (3.11) 
hJo 2 
Particle Moving in One Dimension in a Potential V(x): 
h? 0? 
H = —-—-~, + V(X). 
2m 0x? ) 
The equation for p is explicitly 
op h? 0? 
—-A“E = —--- 7 pt V . 3.12 
ou 2m dx? “ (op one 


In an infinitesimal “time” interval ¢ the particle may be considered as moving 
freely with a small correction due to the presence of the potential V(x). Toa very 
good approximation 


_VQ@) 


EP free- 3.13) 
h Pf ( 


bp = 


This result is demonstrated as follows: From perturbation theory we have 


Solx, x's.0) © “| ax" | poles x's — WV Ex"Wpalx', x50) MH, 3.14 
— oO Q 


3.1 Path integral formulation of the density matrix wh} 


where in our Case, Po = Pree. For low ¢, because py, is a very localized 
Gaussian, most of the contribution in the integral over x” occurs near x” = Xo, 
where it can be shown that 


ux + (6 — u)x’ 
é 


Xo = 


So we can, for small ¢, write Eq. (3.14) as 


2 


bp(x, x’; 8) & — { veo) | Ax” po(x, x"; & — u)po(x", x’; u) 
(0) — 0 


ed 
“ [ V(x X39. (3.15) 


(0) 


Now, if x ~ x’, Xo is also close to x and V(x9) is constant over the range of 
integration. Equation (3.15) goes over to Eq. (3.13): 


; € ; 
bp(x, x’; £) & — ; V(X) Ptrce(X, "5 8) 


under the assumption that |x — x’| and ¢ are small. 
Of course, as |x — x’| increases, our conclusion that f5 V(X) du & éV(x) 
becomes less accurate, and it might be thought preferable to write something like 


; ee + = Ge” a? & + _) 
2 2 


in place of eV(x). But for our purposes, Eq. (3.13) is adequate. So 


FOr Te, [ ~"@ ~ Pred(a)e VOM, (3.16) 
Thus 
ny fm. f mx — x? _ V(x) 
p(x, x';8) & Jin exp | aie P | (3.17) 


To find p(x, x’; u) it is only necessary to apply the correction e~")*/* to each 
element of the “path” of the free particle from x to x’. Thus, as ¢ > 0, 


p(x, x’; U) = [Jee 1"; i Ee + reat) au} Qx(uy — (3.18) 
0 


= {J [x(u)] Dx(u), 
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where x(0) = x’ and x(U) = x. Thus 


®[x(u)] = exp = ip kw + Veet) aul 2 (3.18a) 
0 


The free energy is derived from 


e FF — | oc: x) dx 


= | | {| exp = Is ew + Zo) iu} ax} dx 
0 x{O)=x(U)=x 


(3.19) 


where U = Bh. 

Having derived the general expression for ® the question now is, which 
“paths” are the most important? If the potential is not very dependent upon 
position, then the more direct paths contribute the most. This is because longer 
paths, with higher mean square “velocities,” [x(u)]’, make the exponent of the 
weighting functional ®[x(u)] more negative. On the other hand, suppose the 
potential does depend on position. For simplicity, let the potential increase for 
larger values of x. Then a direct path such as | in Fig. 3.2 does not necessarily 
give the largest contribution, because the integrated value of the potential is 
higher than over another path such as 2 or 3. Consider a path such as 2 that 
deviates widely from the direct path. Then V(x) decreases but x increases over 
the path. In this case we would expect that the increased velocity x would more 
than compensate for the decreased potential over the path. Thus, the most im- 
portant path would be one for which any smaller integrated value of potential 
energy is more than compensated for by an increase in kinetic energy (sym- 
bolically represented by 3 in Fig. 3.2). 


oO x’ 


Fig. 3.2 Three paths between x’ and x. 
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In the classical limit, the temperature is high and U is small. We want the 
classical partition function, and we would like to see how it is related to the 
quantum-mechanical partition function. From Eq. (3.18) 


p(x, x; U) = {J exp - iB = + rw) au} Dx(u). 
0 


x(O)=x(U)=x (3.20) 


If we assume that the most important paths are those for which V(x(u)) + V(x), 
Eg. (3.19) can be approximated by 


U « 2 
exp | -7 v0] {J exp | -; j mu) | Bx(u) 


x(0)=x(U) =x 


2 


p(x, x; U) 


i 


exp |-5 r09 | Prree(X, xX; U) 


“m —m(x — x’? — UV(x) 
J oahu | hU hi | Go) 


Equation (3.21) is just a specialization of Eq. (3.17). But from the derivation of 
Eq. (3.21) it is easy to see the direction of the error in the classical limit. We 
assumed that the most important paths had x(u) ~ x, whereas in fact the most 
important paths are those for which [§ [mx(u)?/2 + V(x(u))] du is a minimum, 
which occurs when (x()) goes over paths that tend to let V(x(w)) be lower than 
V(x). It follows, then, that 


i 


Petassical(, x; U) < p(x, x; U). (3.22) 

The classical partition function is 
e. Pre! a ° ch X, X5 U dx = ase 7 e WOK y, 3.23 
[i ex ) saul. (3.23) 


From Eq. (3.22) we obtain 
Fy 2 F. (3.24) 


To find the condition for Eq. (3.21) to be a good approximation to the path 
integral, note that the paths for which 


U 2 
: ae du> 1 
hjy 2 


do not contribute much to the path integral. Using the Cauchy-Schwarz in- 


equality, we find 
b 2 b b 
| fg < | ure | Igl’. 
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With f = x and g = 1, we have for 0 < u, < U: 


| SG Ik #0) J < ae # (3.25) 
0 ce) 0 


So for the important paths, (m/2hU)|x(u,) — x(0)|? < 1. If dis the maximum 
value of |x(u,) — x(0)| for 0 < u, < U, then the paths that contribute signifi- 
cantly satisfy 


Ix(u,) — x(0)|? = 


mkT d? 
a 


For the hydrogen atom at 1°K Eq. (3.26) yields d < 9A. For oxygen at 
1°K, d < 2.5 A; for oxygen at 100°K, d < 0.25A. 

If the potential varies so slowly that it is approximately unchanged even 
when the path varies by more than d, then Eq. (3.21) is good. Thus, for heavy 
particles at normal temperatures, systems may be treated by classical statistical 
mechanics. However, the classical approximation does not apply to liquid 
helium, nor does it apply to electrons in a metal, nor even to solids if the tem- 
perature is too low. 

A few things should be noticed about this formulation of statistical 
mechanics: 


(3.26) 


1. We have discussed only the case of one coordinate. However, the extension 


to N degrees of freedom is obvious; if x, x, ..., x are the coordinates, 
QYx(u) becomes 
Qx(u) = him dx dx'?) +--+ dx dxQ) - ++ dx$ +--+ dx, (3.27) 
n>o 


2. If “u’ is replaced by “iu” in Eq. (3.1), we have Schrédinger’s equation. Ina 
similar manner to that of statistical mechanics, quantum mechanics can be 
formulated in terms of path integrals. For mathematicians, statistical mechanics 
is easier to deal with because the exponentials in the path integrals are exponen- 
tials of real quantities. 


3. We have not dealt with relativity (in which particles appear and disappear) 
and spin (which is a discrete, rather than a continuous, variable) in the path- 
integral formulation. If such effects exist, we can only get approximations. 
Actually, spin can be put into the formalism, but our interpretation of Dx(u) as 
a set of paths differing by an infinitesimal amount no longer holds. 


3.2 CALCULATION OF PATH INTEGRALS 


We will discuss two techniques for the calculation of path integrals: 
1. Integrals whose exponents are quadratic in x and x will be calculated exactly. 
2. Perturbation expansions. 
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Consider the element 
U ° 2 
f(%2, x1, U) = [Je el mle au} Qx(u) (3.28) 
oO 


where x(0) = x, and x(U) = x,. This is the density matrix element for a free 
particle. Let us expand each path about the straight-line path in the xu-plane. 
Call the straight line path x(u). Then 


x(u) = x(u) + y(u) (3.29) 
and 
ft, %, U) = {] exp | -im [ (iu) + (0)? au] yu). (3.30) 
Now 
dx(u) _ Xz — x, _ ae . 
i f. v elocity (3.31) 
so that 
U 
| [x + yp]? du 
0 


U U 
= | (v? + 2vp + j?) du = v?U + 2w[yU) — y(0)] + | jy? du. 
0 0 


(3.32) 
But y(U) = (0) = 0. Therefore 


mv?U m (U 
X2,x,,U) = exp] —- — —_— 2 du | Dy. 3.33 
f(X2, x1, U) P| dh ial mee “| y ( ) 


Thus we have split the path integral into a factor dependent upon the endpoints 
and U, and a path integral dependent only upon U. Writing 


U 
m 2 
exp| ——- du| Gy = F(U), 
P| he |» (U) 


P(X2,xX,, U) = F(U) exp [—m(x, — x,)?/2AU ]. (3.34) 


y(0)=y(U)=0 
we get 


We can obtain an equation for F(U) by using 


pls. ys, + u;) = | Pear Rar Cane (3.35) 
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The reader should derive that equation for F(u) and show that its most 
general continuous solution is 


F(U) = Vm/2nhu &”, (3.36) 


where a is an arbitrary number. Physically, the value of « is immaterial, because 
a change in the zero of the potential does not change the physics; it does, 
however, change the value of «. As a matter of fact, since the expectation value 
of any quantity, A, is given by 

_ Tr pA 


<A> Tr p 


(3.37) 


F(U) need not be found at all. It cancels out in the division. We know that « is 
zero, however, from other methods of calculating p (see Eq. (3.8)). 
Now consider the path integral for the harmonic oscillator: 


1 (°f mx? — mow? 
,x,,U) = exp 4 — + 21 du Ox. 
Fein xn) [Teen t—g | | + A =| al os 


Let us expand each path about the path that makes the largest contribution 
to P. For this path, 


u 
| G(u) du = 


oO : 


= minimum, 


I 
2 Se 
3 
SEs 
+ 
vg 
*, 
ES) 
Nu 
iJ 


where x(0) = x,, x(U) = x. Thus, our integrand must satisfy the Euler 
equation, 


~ 


0 0G(u) ca 0G = 9) or mx <= mo?x = 0. (3.38) 
du Ox Ox 


For any path, we write 


x=xt+y 
so y(0) = 0, y(U) = 0 and the integral over u becomes 


ik i (x +p)? + me (x + | du 


U 
(x? + w?x?) du + Z| (2x) + 2w2xy) du 
i¢) 


U 
| (jy? + wy?) du. 
0 
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Now 
U 
| 2xy du = 2yx 


oO 


u u 
- [ 2xy du. 


0 Q 
Because y(0) = 0 and y(U) = 0, we have 
U U 
| xp + w?xy) du = [ 2y(—% + w?x) du = 0, 
0 0 


gince x satisfies the Euler equation. Thus, by expanding about the classical path, 
we have eliminated the integrand linear in the deviation. Thus we may write 


U 2 
(3 x? 4 F “) au] F(U), 


FOS ep Ey " 


(1) 


F(U) = {| exp E iA (5 yt me ”) au] Dy(u), 
0 


with y(0) = Oand y(U) = 0, is a factor independent of the endpoints x, and x). 
Thus expansion about the classical path allows us to separate the path integral 
into a factor dependent upon U and the endpoints, and another path integral 
dependent upon U alone. Thus the important dependence on x, and x, can be 
found by simply solving the minimizing differential equations (Eq. 3.38) subject 
to the end conditions x(0) = x, and x(U) = x, and calculating the integral 


u 2 
m x2 4 MO" 62) du. 
ND 2 


In fact even this integral can be simplified. We have 


u u u 
| ™ 22 du =" xx a xx du. 
o 2 2 jo 2 Jo 


U U 
+ | (-¥ + wx) du = xi} * 
re: 2 


where 


Therefore 


U U 
| m (x? + wx?) du = ™ xi 
o 2 2 0 ) 


As a new problem, suppose we have two interacting systems, one of which is 
* harmonic oscillator. Let x describe the position of the oscillator and q that of 
the other system. A possible Hamiltonian is 
<2 2 42 
H = ™® 4 ne x2 4 MG 


V(q) — yxq. 
, - ge ae 


* You can compare your result with Eq. (2.83). 
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The partition function is 


1 (8 (mx? | m@?x? 
Q= {| xp | —; { ( 5 + aa 7a(wx(o) au] 


U 22 
x exp E | Gs + v(a)] au] Dx Da. 
hj, \ 2 


We cannot, in general, evaluate this path integral, but we can integrate out 
the x variable. To do so we must find 


F[q()] = {| exp E {, i oa max = vax) au] Ox. 
ie) 


This integral can be evaluated by the same method as that used for a har- 
monic oscillator. More generally, we can evaluate 


F,[f;x,x’] = {| exp E {, (" + max 4+ viwxtu) au] 9x. 
0 


x(O)=x 
x(U)=x’ 
The result is 
Fi(f; x, x’) = Vmo)2nh sinh oU e~%*, (3.39) 
where 
1 aol tas : mo 
0 = —— e~°l¥—"'le(y) f(u’) du du’ + ————— 
4mw \ i FOFw) 2 sinh wU 
x [(x? + x’*) cosh @U — 2xx' + 2A(xe®? — x’) + 2B(x'e?” — x) 
+ (A? + Be — 2AB], (3.40) 
where 
i U 
A = —— e °“f(u) du, 
2m@ Jo 
i U 
B= — | e7~°-F(y) du. (3.41) 
2m@ jo 
Now let 
‘ U 
&[f] = (exp Ei f(u)x(u) iu) 
h (0) harm, osc. 
fa f Ff; x, x) dx (3.42) 


. § F,(0; x, x) dx” 
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Eventually one will arrive at the result: 


we YU CY cosh w(|u — u’| — 3U) : : 
€[f] = exp | rane |. sinh JoU f(u)f(u’) du du } 


(3.43) 


If we define f outside the interval 0 < u < U by letting f be periodic with 
period U, we may write 


€[f] = exp | ~ amon { is ene" lf(u) f(u') du iu’). (3.43a) 


An alternative way of finding F,(/; x, x) is to Fourier analyze f(u) and x(u): 


+ b, sin 


Sa cack ¥ (« cos 2mne : oat 


S(u) = fo + 3 (eos + ans sin no). 


The path integral 
FiLf] = { F,(fsx, ) dx 


can then be written as 


Ff] = We [dao day +> a db,-:: 
x exp E [, (S + mar 3? + iw) | du 


and can easily be evaluated.* 


* The equation 


cos nO _ 2 cosh (x — |O|)a 
2a a=1 nm? + a* 2a _— sinhza 


is useful. F, [f]as found by the above method has an unknown multiplicative constant 
as a consequence of the Jacobian for the change of variables implicit in Dx(u) > 
da, --- db, db, +++; but Eq. (3.43) can be found unambiguously by the above method. 
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For a free particle, we can find F,[f] either by letting w > 0 or by direct 
calculation. The result is 


{| exp i; i [4mx(u)? + if(u)x(u)] au Dx(u) 
1) 


= J nim {(- flu) au] exp ‘anh ({" {, Ju — w'Lf(u)f(u’) du du’ 
2 U 2 
+= | uf (u) au] ) ; (3.44) 
Ul Jo 


3.3 PATH INTEGRALS BY PERTURBATION EXPANSION 


We will make use of this last result later. 


Suppose that we have found the density matrix element po(x2, x,, U) for some 
system of potential V,, and we wish to know (x2, x,;, U) for another system with 
potential Vy + V’, where V’ « Vo. Then 


p(X2,%1,U) = {| exp \-j iE "S + Vo(x) + | au] ax| 
0 
- _1 (*[ mx? 
SP es 
1 U 
x exp -; | V(x) au} 9g, 
Expanding the second exponential, we have i 
U v2 
= {| exp =; j ie + v0 | au} 
1 U 1 U z 
x ! - Al V'[x(u)] du + ral V'[x(u)] iu) + «| 2s 
ho 2h?\ Io 


U 1 “2 
Sal) ta ea el 


x exp 1; {. a zy vee | aw'| V'[x(u)] du Dx + °° 


1 ¥ 1 ("7 mx? ; 
Po — ‘|, du [Je ak ee + v9 | aw’ 
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O xy 


Fig. 3.3 Integrand of first-order term 


In order to calculate the path integral, we first integrate over all paths for which 
x(u) = y (with x(0) = x,, x(U) = x2) and then integrate over y. 


U 
P(X2,%1U) = po — | | PolX2,¥, U ~ u)V(y) poly, x1, 4) dy du + °°: 
0 ally 

Pictorially, the integrand may be represented as shown in Fig. 3.3. 

We can think of the zeroth-order term, po, as being the contribution to the 
density matrix that results from the particle going from x, to x, in “time” U 
without being affected by the potential. Then the first-order term, which has its 
integrand represented in the Fig. 3.3, may be thought of as the contribution 
resulting from the particle being essentially free, but being “scattered” once at 
some “time” u and some position y. Similarly, the second-order term can be 
considered as the contribution from two “scatterings”, and can be written as 


1 U u (a2) roe) 
vil iu | iw | | dz 
h Oo 0 -o@ -2 
X PolX2,2,U — u)V(z)po(z, y,u — v)V(y)poly, x1, v) 
Pictorially, this integrand may be represented as shown in Fig. 3.4. 


Notice that we have not found any new results. We have already derived 
this perturbation expansion without the use of path integrals. 


Problem: Find to what order the nth-order integral expansion of p(x2, x,, U) 
Solves 


op h? Op 
eS ee ee Va ea: : 
au am ax? [Vo(x) (x)]p 
knowing that 
0 h? @? 
— Po = — 2 CPO + Volx)Po- 


Gu 2m 6x? 
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(x9, U) 


| 


——_ 


Fig. 3.4 Integrand of second-order term. 


3.4 VARIATIONAL PRINCIPLE FOR THE PATH INTEGRAL 
For the free energy we know the relation 

F< Fi +H —- Ho)o, (3.45a) 
or 

F < (H)o — TSo. (3.45b) 


This is written in terms of the Hamiltonians H and Hy. We ask ourselves if we 
can write a corresponding minimum relation using the path integral. 
In the path-integral form, the partition function is written as 


e7 FF = e OA x(u). (3.46) 
x()=x(U) 


Suppose we have another “S”’, So, that is easier to work with. (Here Sy does not 
mean entropy). Then Eq. (3.46) may be written as 


~(S-So)y-S. 
-pr _ le Se" Dx o~PFo 
{J e-%° Dx : 


e FFo = {| e °° Dx, (3.48) 


The first factor of Eq. (3.47) has the form of an average of exp [—(S — So)] 
with e~ °° as the weight for a certain path x(u). We may write Eq. (3.47) as 


COPE =e Ler 80) coe” OFS, (3.49) 


e 


(3.47) 


where 


Now we suppose that S and S, are real and use the inequality 
eS ae, (3.50) 
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i 

| 

J 

i} 

1 

! 
(f) fy 
Fig. 3.5 Geometrical interpretation of (e~*) = e~. 


The geometrical interpretation of this relation is as shown in Fig. 3.5. In Fig. 3.1, 
<e~5) is always above the curve e~4. Note that Eq. (3.50) does not depend on 
how the /’s are distributed. When we apply Eq. (3.50) to Eq. (3.49) we can write 


; e7 PF > @~ S~So 9 BFo. (3.51) 
where 
S — Sy)e~* Dx 
Siege ie 7 =e (3.52) 


Usually Eq. (3.52) is simpler to calculate than the first factor in Eq. (3.47). We 
‘thus have a theorem: 


F<Fot+ 5 — So>s,: (3.53) 
Exercise: When 
_ {i ee (u) 4 reatw) | di: 
Ee (3.54) 


So = { , Ee + reetw) du, 
0 


4 SS Veta: (3.55) 


show that 


Thus Eq. (3.53) contains Eq. (3.45) as a special case. (Hint: from Eq. (3.54)), 
B 

S—S)= { (V — Vo) du. (3.56) 
0 


The numerator of the right-hand side of Eq. (3.52) becomes an integral over u, 
the integrand of which is independent of w. 


88 Path integrals 


3.5 AN APPLICATION OF THE VARIATION THEOREM 


Finally in this chapter let us explore an application of our variation theorem, 


Eq. (3.53). 
In a one-dimensional, one-particle case, we have 
ga! {, aw + v(x) du, (3.57) 
where 
U = Ba. (3.58) 


To avoid unnecessary clutter, we will set # equal to unity in most of the following 
work. Every once in a while we will restore the #’s, so that the reader can see 
where they belong. 

The partition function may be written as 


on oF = pew \- {, ia + v(x0o)| aul Qx(u) dx(0). (3.59) 
0 


Here, we first fix x(0) and x(U) = x(0), and then do the path integral over all 
paths; then we vary x(0). The zero in $ indicates that we integrate over x(0). 
In the classical limit of high temperature, or small #, U is small, so that the 
path does not deviate much from the initial point x(0). Thus the first approxima- 
tion in Eq. (3.59) is to replace V(x(u)) by the initial value V(x(0)) to obtain 


~pF fmt | eVOONT dy. (3.60) 
TT 


This is the well-known classical result. 

How can we improve on this approximation to take the quantum effect into 
account? 

First we observe that, because x(0) and x(U) are equal, it appears more 
natural to use an average 


edie: 
=F [, x(u) du (3.61) 


in place of x(0). Second, the deviation of the path from the classical straight line 
may be taken into account by some average of V(x) over the path rather than by 
a constant V(x(0)). These considerations lead to the following trial So: 


Se , muy” ay 4 Uw(x). (3.62) 
a 2 
Here w(x) is a still undetermined function, which is to be varied later to minimize 


Eq. (3.53). In the first approximation one would expect to choose w(x) = V(x) 
but we can do much better with our variational principle. 
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en PFo = {| Bx exp |-|. "s + Umco iu] 
oO 


all closed paths 


= fa {| exp Pal " du + Uw(a | au] Dx 


all closed paths x 
with fixed average x 


Now let y = x — x(0). Since p(u) = X(u), we obtain 


Uy 39 
e FFo — [a ee? {I exp |-[ si Dy. 
0 


all closed paths 
y with »(0) = 0 


In the path integral above, there is no restriction on y because y = x — x(0), 
and we integrate over x(0). The reasons for integrating over the paths chosen can 
be made clearer by drawing diagrams; the reader should do this if he needs 
clarification. 


ee = pdxe Op... (0.030. = fs * 9 UO dy, 3.63 
| Pend; 0) = [= (" etPdy. 3.6) 


It should be noted here that in the classical case Eq. (3.60), w(y) is replaced by 


V(y). 
Next, we calculate (S — S,>. Using Eqs. (3.57) and (3.62), we see by 
definition 


5S ~ So) 
— {J 1/A(S — So) exp [—J mx?/2 du] exp [— Uw(x)] Bx 
{] exp [—J mx?/2 du] exp [—Uw(x)] 9x 


— {a/U fo Vix(u’)) du’ — w(x)} exp [—f mx?/2 du] exp [-Uw(x)] Dx 
{J exp [- j mx?/2 du] exp [—Uw(x)] 9x 


(3.64) 


Here x is defined in Eq. (3.61) and is functional of x(u). The first term in the 
humerator is simplified as follows: 


(lz [, V(x(u')) du’ exp |- {, mit au exp [—Uw(x)] Dx 
-(4 ctl V(x(u’)) exp | ~ is mi(u)’ au exp [—Uw(x)] 2x 
ls du Je V(x’) {] oe mF au [a] 


ay, x(U) 
=x’ 


x exp [—Uw(x)] Zx(u). 
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We are integrating over paths like those in Fig. 3.6a. By making the change of 
variables 


we integrate over paths like those in Fig. 3.6b. Because of the condition that 
x(u = 0) = x(u = U), we do not get a discontinuity in x at V= U— wu’; 


therefore 
U 2 U «2. 
mx*(u) Hips mx*(V) av. 
o 2 o 2 


Also, 


Ltr Ee V) av 
=5 |, au = 7 x(V) dV. 
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The first term in the numerator of Eq. (3.64) becomes 


[, du! favre {J exp Ee me) av | exp [—Uw(x)] 
0 U . 


x(0)=x(U)=x’ 


= {| V(x(0)) exp |-[ mo iu] exp [—Uw(x)] Ox(u). (3.65) 


x(0)=x(U) 


Thus Eq. (3.64) may be written as 
4S = Sosy = <V(x(0))> — <ul). (3.66) 


To transform further we use the following result (proved previously): 


{] exp cE (lee: m3(u)? au] exp EA Sf(u)x(u) iu] Dx(u) 
pm ((" fu) iu) exp tam I, {, ju — wh f(u)f(u’) du du’ 
0 4m Jo Jo 
2 U 2 
+ ({ uf (u) au) } (3.67) 
U \Jo 


In evaluating Eq. (3.65) we first do the path integral keeping x fixed; then 
Eq. (3.65) is written 


{| V(x(0)) exp Bl matey au] exp [—Uw(x)] Dx 


= [xo exp[—Uw(y)] dy (3.68) 


where 
K(y) = {| V(x(0)) exp |- { : matey | 9x (3.69) 
0 


Such that x = y. Equation (3.69) may be written 


K(y) = {| V (x(0)) exp ee mtu au] d(x — y) Dx(u). (3.70) 
0 


For this we use the Fourier transform: 


V(x) = [eae dq (3.71) 
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to write Eq. (3.70) as 


K(y) = fo dq {ae {| exp [igx(0)] exp [ik(x — y)] 


U 2 
x exp |-| oor au Qx. (3.72) 
0 


Here 5(x — y) of Eq. (3.70) was also Fourier transformed, and we are ignoring 
factors such as 2z. Use Eq. (3.61) in Eq. (3.72) to write 


K(y) = [ow dq {ae exp [—iky] {| exp [igx(0)] exp } x I x(u) au] 


10) 
U «2 
x exp ai = au] Ox. (3.73) 
19) 


Now we see that the path-integral part of this expression can be brought into the 
form of Eq. (3.67) if we define 


Or Cee . (3.74) 
because 
U k U 
| f(@)x(u) du = qx(0) + al x(u) du. (3.75) 
0 U 1) 
When we use Eq. (3.74) in Eq. (3.67) we see that 
[,4@ du=q+k, (3.76) 
0 
is Pye cc (3.77) 
P 2 


{, {, lu — u’|f(u)f(u’) du du’ 2] du i, (u — u’)f(u'’) du'f(u) 
o Jo 0 0 


U u k 
2| du [| (u — u’)— du’ + ug | se 
0 0 U 


e ku? k?U 
2) du|— + = —— + kqv. 
| u Ee a] f(u) 3 q 


0 


(3.78) 
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Use Eqs. (3.76), (3.77), and (3.78) in Eq. (3.67) and also in the path-integral part 
of Eq. (3.73) to write the latter as 


: U (k? k? 
K(y) = [em [a exp [—iky]6(q + k) exp FE & + kq + | 


k?U 
= [dave [ax exp [—iky]5(q + k) op| —Fa| 


= [eer exp [ —iky] exp | . (3.79) 
m 


Using the inverse Fourier transform, we find that Eq. (3.79) becomes 


K(y) = [a exp [—iky] [ve exp [ikz] dz exp Ea 


= | deve) en] Fo - 2]. (3.80) 


When the correct factors are taken into account, the final form is 


K(y) = oe | ee) exp | ~ST 2). (3.81) 


It should be noticed that K(y) is V(z) averaged over a Gaussian. The root mean 
square of the Gaussian spread is 


ast Bh. 3 (3.82) 


V12 mkT~ 


This spread is about 1 A for a helium atom at 2°K, and is a very narrow spread 
atroom temperatures. At the limit of infinite temperature, the Gaussian becomes 
a delta function, and K(y) > V(y). 

Summarizing the results obtained so far, we have 


F<Fy+ ; (Sass (3.83a) 
-BFo 7 Es. 
e [exe C-onen) ay |. (3.83b) 
leg 55, = LAO)— wo) ON ay (3.83¢) 


B je frm dy 
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Equation (3.83b) is Eq. (3.63). Equation (3.83c) follows from Eqs. (3.64), (3.65), 
and (3.68). Further, K(y) in Eq. (3.83c) is given in Eq. (3.81). 


Now we ask what is the best choice of w(y)? Changing 


wy) > wy) + n(y) (3.84) 
where n(y) is small, we see from Eqs. (3.83b) and (c) 


SP ea ea (3.85a) 
= fe ™*[—-Bn({K — w) — n dy] ees — w) dy Bne~*” dy 

o4<s So = Te dy (eo ayy 

a (3.85b) 
uS 
5 (Fe + 5 fi S.)s) ~0 (3.86) 
leads to 

w(y) = K(y). (3.87) 


This is the best choice of w(y). 
In this case Eq. (3.83c) yields 


‘Ss - Soso = 0, (3.88) 


so that, from Eq. (3.83a), 
F < Fux (3.89) 


where F., x means the classical free energy with the potential V(y) replaced by 


K(y). Alternatively, 
g Phere C= e BK) 3.90) 
2nhU ® ( 


compare Eq. (3.60). K(y) is defined in Eq. (3.81). Fy x is a better approxima- 
tion than the ordinary classical form F,, in Eq. (3.60). 


Example: To find out how good Fy, x is, let us consider the harmonic oscillator. 
In this case the potential is 


252 
V(x) =m oes (3.91) 
From Eq. (3.81) 


2 2 2 
K(y) = [omit aan Ea exp| (y - | dz Gi + Fe). 
TT 
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Using this result in Eq. (3.90), we obtain 


2 2 
~BFclK _ m _ p mo 2 WB d 
: Jay fe[ a (» + om) | ‘ 


= (1/ABo)e 00/24, (3.93) 


Thus, we have the following results: 


Fog 7 In E sinh ("3") ; (3.94a) 
B 2 
™ 1 h? B?w? 
Fax = F [ (Aw@p) + vi | (3.94b) 
Fy = : In (tw). (3.94c) 
When we write 
G= rae and f= a (3.95) 
Eq. (3.94) becomes 
Girue = ; In (2 sinh f), (3.95a) 
2 
Gak = , In (2f) +f ; (3.95b) 
Gy = ; In (2f). (3.95c) 


Table 3.1 is a numerical comparison of Gyue, Gey x and Gy. This table shows 
the remarkable improvement Go, x has over Gy. Remember that f = 4 cor- 
Tesponds to the temperature kT = fw where the quantum effect is large. 


Table 3.1 

f= 1/2 f= f= 
Girue 0.08263 0.8544 0.9908 
Gack 0.08333 0.8598 1.0264 


Ga 0 0.6931 0.6931 
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The expression F,, , in Eq. (3.90) combined with Eq. (3.81) is better than the 
series expansion of K(y) in terms of V(y), d?V/dy?,.... 

Nevertheless, Fx is not as useful as Table 3.1 might indicate. First, it 
cannot be used when quantum-mechanical exchange effects exist. Second, it 
fails in its present form when the potential V(y) has a very large derivative as in 
the case of hard-sphere interatomic potential. 


CHAPTER 4 


CLASSICAL SYSTEM OF N PARTICLES 


4.1. INTRODUCTION 


The classical partition function of N interacting particles is written as 
3N/2 
ePF - 1 f_™_ Zys (4.1) 
N! \2nh?B 


Zy = ere ena tk, d3R, ++: d3Ry. (4.2) 


where 


These equations can be derived from the quantum-mechanical results we have 
obtained previously. From Chapter 2, Section 8, we have 


e FFs = Hip | PoRare oo Kas Pas PX) dX,-+:dXy 
: P 
for Bose statistics. For Fermi-Dirac statistics, 
e-PFa wn wi ¥ (=1)?o(X1,--., Xy3 PXy,.-., PXy) dX, °°: dXy. 
. P 


In either case, only the identity permutation is important, because at 
sufficiently high temperatures, factors such as e~™*7/2#(X—PX)? Kill the other 
terms. It follows that: 


ae) a Py(X1,-.--, Xy3 Xq,---, Xy) dX, +++ dXy. 

We have already estimated this integral for the case N = 1 by the use of 
Path integrals (see Chapter 3, Eq. (3.60)). Generalizing the method used to the 
case of N particles in three dimensions, we arrive at Eqs. (4.1) and (4.2). 

When a gas particle is a polyatomic molecule, the internal motion of a 
molecule and the motion of the center of gravity can be separated; then the 
Specific heat is a sum of the two contributions. In Eq. (4.2), R; is regarded as 
the center of gravity of the ith particle; the partition function for the internal 
Motion is not included here. The latter can be calculated using information 
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about the energy levels obtained from the infrared spectra, or calculated using 
quantum mechanics. 

For a dense system such as a liquid the internal motion of polyatomic mole- 
cules and the motion of center of gravity are entangled with each other, and the 
two are hard to separate. 

The following discussion excludes the internal motion of a particle, and the 
system is regarded as a gas or a liquid of inert gas particles. 

The free energy is written from Eq. (4.1) as 


3N ,_ 2nh? N,N 
P= ing + in -FinZy + 7X, (4.3) 
2B 2p m B B 


The e in the last term of Eq. (4.3) is the base of natural logarithms, rather than 
the electronic charge. This e appears because we used the Stirling formula: 


N ——— 
Nix (7) J2nN >In N! w N (i *). (4.4) 
e e 


From Eq. (4.3), we have for the internal energy U 


2 
y = - 7? SED) «ger 4 BO Sy, (4.5) 


where 
R = Nk. (4.6) 


The equation of state is found to be 


PV _ V_aZy 
RT NZy 6V’ 


(4.7) 
from Eq. (1.6). 


Example: For an ideal gas, V(R,,..., Ry) = 0; therefore Zy = V™ and 
Eq. (4.7) becomes 
PV = RT. (4.8) 


When we assume that the potential energy can be written as a sum of pair- 
wise potentials: 
V(R,R2°°* Ry) = YY (ru) (4.9) 


J 
pair 


Eq. (4.2) can be written 


Zy = fee E > viru) | aR, d?R,+--d>Ry. (4.10) 
pair 
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—os 
% 


Fig. 4.1. An unrealistic potential; vp is the volume of a cluster of N particles. 


Note that this expression contains all the information about the phase transitions 
of the system. In other words, the nature of the phase transitions can, in principle, 
be discussed by examining the behavior of the integral in Eq. (4.10) purely 
mathematically, without physical knowledge. 

Equation (4.10) suggests that when the number of particles N is finite there 
is no discontinuity in physical qualities. Only when N > co do we expect 
discontinuities, and this is the case we are interested in. 

The assumption of a central force potential V(r;;) in Eq. (4.9) is not exactly 
justified. The necessity of a noncentral force is illustrated by the following 
observation: In the solid state, if we assume the nearest-neighbor interaction 
only, the face-centered cubic and the hexagonal close-packed structures are not 
distinguishable. When the second nearest neighbors are taken into account the 
central-force assumption leads to the conclusion that hexagonal close packing 
has the lower energy, whereas face-centered cubic is actually more stable for 
solid argon for example, which is inert in its gaseous phase. 

The free energy F of Eq. (4.3) is not necessarily a function of V/N only. 
If we take an unrealistic potential like the one in Fig. 4.1, the energy of the 
System is roughly of the form CN?, where C is a constant and Nis the number of 
Particles forming a cluster of volume vp. Thus, we can think of two extreme 
forms of Zy: 

ZO) = e~BNPCyN 
Dee ee (4.11) 
Zo eV, 
When the temperature is low, for large N, the contribution of ZY can be larger 
than Z?); then Fis not a function of V/N only. 
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4.2 THE SECOND VIRIAL COEFFICIENT 


When the gas is not dense, the data of the equation of state are summarized 1) 
practice as 


t+o+—S+- (4.12 
V 


where B is called the second virial coefficient, and C the third virial coefficient 
In this section we will derive an expression for B. We assume a pairwise potentiz 
as in Section 4.1. 

Consider a system of N + 1 particles. In this case, Eq. (4.10) of Section 4, 
becomes 


2 = [ow] -#y V(X — a) | xp] ~p og ve| d?>X d°*R. (4.12 


When we integrate over X first, all the R,’s are fixed. If the gas is not dense 
the volume for which V(X — R;) is appreciably different from zero is small; s 
we May approximate 


Zy+1 & VZy. (4.14 
Equation (4.14) will lead to the ideal-gas equation of state. To obtain a bette 
estimate of Zy,,, we write Eq. (4.13) as 
_ fexp[-B ds V(X — Rj exp [-B Lpair Virij)] d°X a PNR 
j exp L=B Pai: V (r;,)]d°*R 


x [ow |-6 > ver] d?XR, (4.15) 


pair 


Zy+i 


The ratio part of this equation has the form of the weighted average of 
[oe [-BYV(x ~ Ry] d?x. 


We will evaluate this average by neglecting the three-body and higher-order 
collisions. When X is close to R;, this pair is assumed to be far away from any of 
the rest of the R;. This assumption is equivalent to replacing the weight factor 


exp |-# Dy vero | 


by unity (see Fig. 4.2). When X approaches R; in Fig. 4.2(b), we have a three- 
body collision among X, R;, and R;. Therefore, we neglect the configuration (b). 
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Weight: 1 Weight: e BV, ,) 
(a) (b) 


Fig. 4.2, Weighting factors in the derivation of the ideal gas equation of state. 


Thus, when we neglect the three-body and higher-order collisions, we can 
approximate Eq. (4.15) as 


~ Jexp[-B 5, V(X — R)] d?X d?®R 


Zye1 [aR Zy (4.16a) 
_ [2X [f exp [Bex — RY aR 7. = (A166) 
Now, we make the transformation: 
femora =V- fu — @ BVI RY) dR, (4.17) 
When we write the second term as a, 
a= is [1 — eA’ ]4nr? dr. (4.18) 
0 
Using Eqs. (4.17) and (4.18), we write Eq. (4.16b) as 
Pees: re Zy. (4.19) 


This is a recurrence relation for Zy. We can solve for Zy from Eq. (4.19) as 


follows: 
a N-1 
Zn = v(t — r) Zy-1> 


a\N-2 
V(lt- = Zy-25 4.20 
re a0 


N 
z 

I 

ll 
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Multiplying all the equations in Eq. (4.20), we have 
a (N~1)+(N-2)40°°41 a N(N-1)/2 
Zy =V*{1-— =V*(1—— . (4.21) 
V V 


When N is very large, it can be approximated as 
Ze Verte, (4.22) 


This exponential part is interpreted physically to mean that each pair contributes 
a factor e~ @/¥), 
The equation of state is derived from Eqs. (4.22), and (4.7): 


2 
Py _Vélndy_V 8 (yyy — N2a\ (4.23) 
RT N OV N oV 
or 
PV/RT = 1 + Nal2V A) 


Comparing this with Eq. (4.12), we obtain the expression for the second virial 
coefficient: 


B = Na/2. (4.25) 


This is an exact result, in the sense that the three-body and higher-order col- 
lisions that we have neglected have effect only for the third virial coefficient and 
higher-order terms of Eq. (4.12). 


Example: Let us consider a potential as shown in Fig. 4.3. Then Eq. (4.18) is 
evaluated as 


b roe) 
a= | (1 — 0)4nr? dr + | [1 — ef? ]4nr? dr. (4.27) 
0 b 


4 vir) 


$7) 


Fig. 4.3. A hypothetical potential. 
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When the temperature is high we may approximate Eq. (4.27) as 


4nb? 1 f° 
fi eer r)4nr? dr, 4,28 
3 KF |, pr) (4.28) 
which we may write in the abbreviated form: 
D 
a=%- 7. (4.29) 
When we use this in Eq. (4.24), 
A a eel (4.30) 
RT 2V 2VT 


We can find out if this expression fits experimental data well by comparing 
it with the Van der Waals equation of state, which is a good fit to the data. The 
Van der Waals equation: 


(P + a,/V7)(V — b,) = RT, (4.31) 
can be expanded as 
P mas 2 
PV yy bun Ql RT Be ‘ica 
RT V V 
If we set NV,/2 = b,, and ND/2 = a,/R, we arrive at Eq. (4.30) up to the 
second virial coefficient. If the Van der Waals equation is valid, it gives a second 
virial coefficient of 


B,, = b, — a,/RT. (4.33) 
A special feature of this expression is that, because a, is positive, B, 


becomes negative for low temperatures. Experimental data for helium are 
drawn in Fig. 4.4. The slight decrease for high temperatures does not follow 


—» d 


oy 
Y 
~ 


23° K 


Fig. 4.4 Experimental data for helium gas. 
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from Eqs. (4.33) or (4.30). It must be remembered that Eq. (4.30) is based on the 
hard-core assumption of Fig. 4.3. In actuality, the potential V(r) in Eq. (4.18), 
which is accurate, is slightly soft inside r = b, so that for high temperatures, 
when particles are hitting each other hard, they can come slightly coser than 8. 
Thus there is a decrease in the ‘“‘hard-core”’ radius, and hence a decrease in B. 


Numerical results: When we use the Slater potential 

V(r) =e“ — ©,/r°, (4.34) 
or the Lennard-Jones potential 

Vir) = A’/r" — B'/r®, (4.35) 


in Eq. (4.18) we can calculate a. Incidentally, in Eqs. (4.34) and (4.35), the r~° 
term comes from the Van der Waals attraction. 
Table 4.1 lists the numerical results for helium gas (in units of cc/mol). 


Table 4.1 

T°’K *Beatc (CL.) Boos TBeatc (Q.M.) 
350 10.80 11.60 10.82 
250 11.34 11.95 11.16 
100 10.75 10.95 

35 4.80 4.44 

20 —6.95 — 4.00 —5.14 

15 — 20.0 —14.0 —15.1 


* The Slater potential is used. J. G. Kirkwood, Phys. Rev. 37, 832 (1931). 
+ J. G. Kirkwood, Phys. Zs. 33, 39 (1932). 


The quantum-exchange effect on the second virial coefficient was neglected 
in this section. This effect becomes important only for temperatures below 
about 1°K. 

The general references on the subject of the second virial coefficient are: 
J. de Boer, Report on Progress in Physics, Vol. 2, p. 305 (1948-1949). Mayer- 
Mayer, Statistical Mechanics (John Wiley & Sons, Inc. New York, 1940). 


Problem: For a mixture of two gases with atomic fractions XY, and X>, such that 
X,+ X, = 1, 
show that the second virial coefficient is 


B = B,,X?2 + 2B,.X,X> + Bo2X3. 
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43 MAYER CLUSTER EXPANSION 


From Eq. (4.2), we know that the classical partition function for an N-particle 
jnteracting gas is 


zm 1 (mkT\3X? 

BF _ 

e N! (oa) 4x oe) 
Zy = demain pees Re) d3R, d3Ry +++ d3Ry. (4.37) 


The equation of state can be written, as in Eq. (4.12), as 


Ay rea eee ee (4.38) 
RT vv? 

This is the virial expansion; as before, B is the second virial coefficient, C the 
third, and so on. 

This formal expansion can, in principle, give all the virial coefficients. It 
was originally hoped that this expansion would yield the triple point, and 
related phenomena. This was not to be. For reasons to be discussed later, the 
formal program, which we now review, has not proved practical. 

In the following work, it is implicitly recognized that the /th virial coefficient 
is due to /-fold clusters. 


V(R,, Rz,..., Ry) can be written (we assume) as ) V(r;;). 


1 [ee [-B ¥ Virjy)] d*R. (4.39) 
Let 
Wy(R1, Ro,...Ry) = exp [-B ¥ V(r;,)]. (4.40) 
If there is one particle, then 
W,(R,) = e789 = 1. (4.41) 


For two particles, 
W,(R,, Ro) = ere 
W3(Ri, Ro, R3) = exp {—B[V(ry2) + V(ro3) + V(ri3) J}, ete. (4.42) 
Let 
W,(R,) = U,(R,). (4.43) 


If there are N points characterized by W,(R,,..., Ry) and if points 1 to M@ 
are far from points M+ 1 to N, Wy(R,,..., Ry) © Wy(R,,..., Ruy) x 
Wal Regis: cee. For example, if N = 2 and R, is far from R, 
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W,(R,, R2) ~ U,(R,)U,(R2). If the points are close to each other, this approx- 
imate equation must be corrected by the addition of a quantity, called U,(R,, R,), 
that is large only when R, and R, are close. 

Thus W,(R,, R,) can be written as 


W,(R,, Ro) = U,(Ry)U,(R2) + UA(R;, R2). (4.44) 
Similarly, W3(R,, R,, R3) can be written as 
W3(R1, Rz, R3) = U(R,)U(R2)U(R3) 
+ [U2(R1, R2)U\(R3) + U2(Ro, R3)U(R1) 
+ U,(R1, R3)U,(R2)] 
+ U(R,, R,, R3). (4.45) 


U;(R,, R2, R3) is the contribution due to all three particles close to one 
another; U;(R,, R,, R3) ~ 0 unless the particles are “‘clustered.”’ 


Wy(R1; R, sey Ry) = U(R;) see U(Ry) + wey + UMAR, R,, sang Ry). (4.46) 


There is another way of expressing the above equations. If we set U,(R;, Rj) = 
Sip then e8V") = 1 + f,, and, from Eq. (4.40), 


Wy(Ry, Ry,..-, Ry) = [] (1 + fi). (4.47) 


i<j 
For example, if N = 3, 
W3(Ry, Ro, R3) = 1 + fio this thes + fiehe3 + fishes 
+ fiofis + firofoshis- 


We can express each individual term in this sum as a graph consisting of 
points and lines. If N = 3, then 


3 3 3 3 
hs Sy fa aes fife FioPida er Ly: 
1 2 1 2 1 2 1 2 


If G represents a graph, and W(G) is the integral over all space of its cor- 
responding function, then 


Zy = ¥ WG), (4.48) 
G 


where the sum is taken over all possible graphs with N points. For disconnected 
graphs, such as 
5 3 
1 2 
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W(G) is a product of its disconnected parts. A cluster of N points is represented 
by a sum of all possible graphs in which those points are connected. For example: 


3 3 3 


3 
U3(R,, Ro, R3) = ex + 7% at + L\. 
2 


12 12 #1 1 2 


The Uy can also be expressed in terms of the W’s. From Eq. (4.43) U,(R,) = 
W,(R,). From Eq. (4.43) and (4.44), 


U,(R;, R,) = W(R,, R,) - W,(R,)W,(R2). (4.49) 
From Eqs. (4.45) and (4.49), 
U3(R;, R,, R;) = W3(R,, R, R;) as Dy WA(R,, R,)W,(R3) 
+ 2W,(R,)W2(R2)W3(R3), (4.50) 
where 
> WR, R,)W,(R3) = W2(Ri, R,)W,(R3) a W,(R2, R3)W,(R,) 
+ W,(R,, R3)W,(R2). 
In general, 
U(R') = U(R,,...,R)W(R,,-.-,R) — y W,(Ri, R,)W,- (Ri vip sg dey et 
+ (-1) 1 — 1)! W,(R,) ++ Wy (RD, (4.51) 
Here the coefficient for a term with k groups is (—1)*"*(k — 1)!. 


Thus, from Eq. (4.40), all the Wy are known. From Eg. (4.51), U(R') in 
terms of the W, is given; thus the U,(R') can be found. (4.52) 


In any particular partition of the N particles, there will be m, clusters of 
size 1; and of course 3°, Im, = N. 
For example (as is evident from Eq. (4.45)), a three-particle gas can be 
Partitioned into 
1. 1 cluster of 3, m,; = 1,m, =m, = 0. 
2. 3 clusters of 1, m, = 3,m, = m,; = 0. 
3. 1 cluster of 2 and 1 cluster of 1 (this can happen 3 ways), m, = m, = 1, 
m;, = 0. 
From the above discussion, with Eqs. (4.45) and (4.46), 
Wx(R®) = Sony T] U«RD ¥ Im, = N. (4.53) 


Sim, Means the sum over all possible partitions of N, that is, all divisions of 
Particles into groups. 
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From Eqs. (4.37), (4.39), and (4.40), 


= ON as | Magn 
Xy= Pag er a™R. (4.54) 
Now construct the expression 
1 


This equation is known as Mayer’s cluster integral. From Eqs. (4.51), (4.53), and 
the statement labeled (4.52), 


1 
Xy = i Scns 11 | U<R') 


1 ae 
ayy Stony iH (Vl by). 


But if N = 5, for example, U(R,R,)U(R3, R4)U(RS) gives the same contri- 
bution as U(R,R3)U(R, Rs) U(R,4), which is a different partition with the same 
set of m;. For each possible set of m, there are several terms, all of which are 
equal. To see how many such terms there are, note that there are NV! permuta- 
tions of the N coordinates, and each such permutation corresponds to a term 
in Som) With the same set of m,. But we are counting each term more than once. 
We should not count different permutations of the coordinates within a given U, 
for example, U(R,, R,) = U(R2, R,). We also do not count changes of order 
of the U’s as separate contributions. For example, U(R,, R,)U(R3, R4)U(Rs) = 
U(R3, R4)U(R,, R2)U(R5) are not counted separately. Thus for a given set of 
m, there are N!/T], (/!)"m,! terms. So we can write 
-24_! (VE B)™ ny 


N! N! att possible sets f dy™'m,! 


of m;: Lim, = N 


=) fie. (4.56) 


{mj} I m,! 


Xw 


It is important to note that (for large volume), 5, is independent of volume. 
That this is so can be seen as follows: U,(R') vanishes unless the particles are 
clustered. Holding dR, fixed and varying dR, ..., dR, then gives some number 
completely independent of volume, because unless R,,...,R,; are near Rj. 
U(R) = 0. 

The integral over dR, then gives a quantity proportional to the volume, V. 
and as b, was defined with a V, in the denominator, b, is independent of V. 

We must now evaluate Xy = Som) [Ii (Vb)™/m,! subject to ©. ml = N. 


43 Mayer cluster expansion 109 


Recalling a similar summation encountered earlier (for the quantum 
statistics of a many-particle system), we note that if the restriction }' ml = N 
could be removed, Xy can be easily evaluated. Using the same methods as before 


we define 
3N/2 
e” Fx = me us Zn 
N!\2nh?B 
and 
ee =, Se PN ea (4.57) 
N N 
where 


es(2 i eB, 
2nh?p 


But first taking the sum 3°.) with >) m,/ = N and then summing over all N is 
clearly equivalent to summing over all m, with no restriction. 


_ Vb Tm tpybmr eed 
ews) TI As ae (no restrictions) 
mi ol m,! 


(Vby)™a™ (Vbq)™a2"> 
my,m2,""° m,! m,! 


exp lv > “| 
t 


g = —kTV ¥ a'b,. (4.58) 
~ | 


This is known as Mayer’s cluster expansion. 
From Eq. (1.51) 


3 aoe 
(NP = 2 


+kTVY¥. Ia! 1 of b; = V Y lo'b,. 
T Ou T 


il 


Therefore the equation 
p =<N>/V = ¥ letb, (4.59) 
7 


determines « if we know the density. 
From Eq. (1.53) the pressure is given by 
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Therefore we get the equation of state 


PV — PV _ Sb,’ 
RT NkT Y, Ibe!’ 


For small density, « is small—as can be seen from Eq. (4.59). Then a't! < 
a’; so that expansion (Eq. (4.60)) need only be carried out a few terms. For 
small /, b, can be calculated easily. Inverting Eq. (4.59) to get « as a power 
series in 1/V, and writing 


(4.60) 


RT V V 
we have 
B= ae N= aie a e FB(r12)] dR, (4.61) 
C = 4b; — 2b,b3 = 4B? ae A 2 = e PV (r12) pas, e BV (r23) e@ FV (ri3) 
b r 
+ hala) dR3. (4.62) 


When the density becomes large, the Mayer cluster expansion is of little use. 
This is because at very high density (liquid), the most important term in the 
expansion becomes the Nth term. That is, almost every particle becomes a 
member of a very large cluster— and in a liquid all the particles are in an N-fold 
cluster. 


Hard Sphere Gas 


The question may be asked: What is the partition function of a hard-sphere 
gas—that is, a gas composed of small inpenetrable spheres that exert no forces 
on one another when not touching? Unfortunately, this question is still un- 
answered. The question may be stated in a different way. 


Zy = 1 dR. 
no two closer than 2a. 
a=radius of sphere 


But 


VN = ANR. 
no restriction 


Thus { d*R/f d¥R is essentially the probability that if N points are “dropped” 
into a volume V, no two particles will be closer than 2a. 
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44 RADIAL DISTRIBUTION FUNCTION 


Consider a system of N particles interacting with pair-wise forces. The partition 
function Zy is 


Zy = | exp |- BY. ’] d?"R, (4.63) 


pair 


The probability density for particle | being at R,, particle 2 at R,, etc., is 


1 
i) exp [-BV(R,,-.., RW) 
Zy 

where the factor of 1/Zy provides normalization. The probability that a given 
particle, say particle 1, is at Ry and the rest of the particles are anywhere is 


2) [oe [—BV(R,,..., Ry)] d°R, d°R3,..., d°Ry. 
N 

Because there are N particles, the probability density for one of these particles 
being at R is 


N 
n,(R) = Z. [ee [-BV(R,, R),. een Ry)] d*R, re -d3Ry 


N 
= 7 fe |-6 > Very) dR aR: (4.64) 
Zy pair 
n, is the one-particle density, or distribution function. 
The two-particle distribution function 7,(R,, R,) is defined as 


n(R,, R>) = a [ew |-6 y v| d?R,---d?Ry; (4.65) 


pair 


h, is the probability density for finding a particle at R, and another particle at 
R,. Fora liquid or a gas, n,(R,, R>) is a function of the distance between the two 
Particles. 


n,(Ri, R2) = n2(r;2). 


The general shape of n,(r) is as shown in Fig. 4.5. When r - ©, n2(r) 
approaches n? where n, is the one-particle density; this is because when the 
distance becomes large, the existence of one atom does not influence the distribu- 
tion of the other atom. 

_ The radial distribution function n,(r) is related to x-ray or neutron diffrac- 
tion as shown in Fig. 4.6. The x-ray is coming in along the z direction and is 
Xattered by atoms. The scattered x-ray is observed at P somewhere far from the 
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no{r) 


7 


Fig. 4.5 Two-particle distribution function. 


z 
Incoming x-ray 
wave vector K;, R; 


a unit vector 


Outgoing wave 
wave vector Kout= * ip 


io) 


C, reference center 
somewhere in the liquid. 


We can define Z=0 at C. P 


Fig. 4.6 X-ray diffractions. 


liquid. The amplitude observed at P is the sum of amplitudes scattered by each 
atom, the latter being of the form 


a(0)(e*®/R). 


Here R is the distance from the scattering atom and a(@) indicates the possible 
angular dependence of scattering, which is not important here. Taking into 
account the phase of the incoming wave, e’*’2, we write 


Amplitude = a(6) , (el*RsreikZs/R 5), (4.66) 


P is the point of observation. Then, approximately, 


Rjp = Rep = R; + ip 


and we may write 
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So Eq. (4.66) becomes 


eikRcp ; ; 
Amplitude = a(@) Ye Kou Ripikin Ry 
Rep 7F 
or 
. eikRcr 
Amplitude = a(@) -G, (4.67) 
with 
G= y exp [i(Ki,,°R; — Ko * Rj)] = 3 exp (iq: Rj), 
J j 
where 


q= Ki, i, Kou 
The intensity of the x rays is 
Intensity = |Amplitude|? = (\a(6)|?/R2,)|G|?. (4.68) 


To compute the intensity, we need to know the expectation value of |G|?.. Now 


N 
|G|? = ~ eit (Ri- Ry) 
ij=1 


For a liquid, neglecting the i = j terms in the double sum, we have 


<IG?> = ore 1) ates i [-B ent (EM) d?%R (4.69) 
N 


because the term in brackets, [ ], is the probability for the configuration R,, 
R,,..., Ry. Now, using Eq. (4.65), we write 


(IGP) =4 | ci Pn (Ry, R,) dR, dR, (4.70) 


When 7,(R,, R,) depends on |R, ~ R,| only, V = the volume of the liquid, 
and g = ||, 


<6?) = 4V | Xn xX) dex = 2 | mn) singrdr. (4.71) 
q 
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In the classical case, if we know n,(r), we can derive all the thermodynamic 
functions. 
The internal energy is 
1 6Zy 


U = 38RT —- — 4.72 
: Zy OB OH) 
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where Zy is the partition function defined in Eq. (4.2), now 
1 dZy ete le exp[—B > V] d?*R 


Zy OB Zy 
_ | Vr23) exp [-B> V] dR: -- 
Zn 
=~ N(N — I) [row exp [-B> V] dNR. 
2 Zy 


Here, we use Eq. (4.65) to arrive at 


1 éZ 1 
Za OB a “3 | V(ry2)M2(r12) d°R, d?R, 
or 
sae = RT ny “ V(r)n,(r)4nr? dr. (4.73) 


This equation could have been expected. The first term of Eq. (4.73) is the 
kinetic energy, and the second term is the potential energy. The latter comes from 
the pair potential V(r), and the factor 1/2 appears because each pair is counted 
twice. 

The pressure is expressed as follows: 


pV = RT —- x R- VV(R)n(R) d?R (4.74a) 
or 
Vol ‘ 2 
pV = RT —- ei rV'(r)n.(r)4ar° dr. (4.74b) 


These are classical formulae, which do not hold for liquid helium. Equation 
(4.74b) is derived as follows. From Eq. (2.121) 


a, 


As in Eq. (4.73), (KE) = (3/2)RT. The second term of Eq. (4.75) can be 
written 


(z rid) = Ma <r "(r12)> 


aN Dye ornare) 
2Zy 


= 4 [reveameria d>(R, — R). 
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Equation (4.74b) follows. Equation (4.74a) is nothing but the obvious 
neralization to the case of potentials that are not spherically symmetric. The 
product R-V V(R) in the integrand of Eq. (4.74a) is called the virial. 
When the force is homogenuous of degree (n + 1), 
V(r) = —-cr", 
so that 
rV'(r) = nCr" = nV(r) 
‘we can use these in Eqs. (4.73) and (4.74b) to see that 
3pV + nU = (3 + 3n)RT. (4.76) 
This relation is called the virial theorem. 
Problem: Show that at low densities 
nr) = 1 — e PYM), 


Using this result in Eq. (4.74), derive the virial coefficients in the virial expansion. 


4.6 THE BORN-GREEN EQUATION FOR 1, 


In Section 4.5 we pointed out that one can find all thermodynamic functions 
given n,(r). In Section 4.1 of this chapter we showed how we could find (in 
principle) e~ *¥ given the potential between pairs of particles, and from F we can 
also get all thermodynamic functions. But to get e~** we must integrate over 
something like 1073 variables. Computers cannot do such an integration; so we 
have got to find some way of getting the answer approximately. The Mayer 
cluster expansion was one way, and it was satisfactory for a gas. So far, for a 
liquid, we have replaced F by n,(r), which requires six fewer than the 107? 
integrations for F. Clearly, we still have quite a way to go. One approach 
toward approximately evaluating our monster integral is by means of the Born- 
Green equation. 


n,(R,, R,), defined in Eq. (4.65) can be transformed as follows: 


n(R,, Ry) = ee E y v| d?R,---d?Ry 
N 


pair 
= we few -8 xy VR) - BY ve) | 
N i#xl 14 


x dR, ais d>Ry.’ 
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Differentiate the above equation to obtain 


Ving R= + X=) [ve 7) vd] 
i#z2 


Zy 
x exp (-6 = V) aR d Ry (4.77) 
pair 
When we define 
N! 
n,(R,, R,, R3) = ————-—— _| exp] —- V 1 d°R,- ++ d?Ry, 4.78 
3(R,, Ro, Rs) TEER P| BY | 4 N ( ) 


we can write Eq. (4.77) as 


V,n(R,, R) = — BV, V(R,2)n{R;, R,) 
ay [r7ess Ry, Rs) @Ry. (4.79) 


This is the Born—Green equation.* 


The special feature of this equation is that both n,(R,, R,) and n3(R,, R,, R3) 
are involved. When we go further and write the equation for n; we need n,. The 
equation for n, includes n,, and so on. Thus, we have a hierarchy of equations. 
This kind of hierarchy appears in other branches of physics—for example, in 
meson field theory—but no general, accurate way of treating the set of equations 
has been established. When we make a guess for 1,, say, and cut off the hierarchy 
there and calculate back n,, we might get a good estimate of n,; but-that an 
error in the guess of n; becomes unimportant as we approach n, has not been 
proved. 

An ingenious way of closing the hierarchy is due to Kirkwood. When R; Is 
very far from R, and R,, 


n3(R,, R2, Rs) > n(Ry, Ron. 
Considerations such as this led Kirkwood to the approximation 


n,(R,R,)n2(R,R3)n(R3R,) (4 803 
3 , 
ny 


n3(R,, R2, R3) = 


which is called the Kirkwood approximation,{ or the superposition approximation. 
When we use Eq. (4.80) in Eq. (4.79) we can solve for n3. 


* Born M. and Green H. S., Proc. Roy. Soc. (London) A188, 10 (1946). See also 
J. Yvon, Actualities Scientifique et Industrielles (Herman and Cie, Paris 1935), No. 203. 


+ Kirkwood J. G. and Boggs E. M., J. Chem. Phys., 10, 394 (1942). 
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4.7 ONE-DIMENSIONAL GAS 


Our attempts to evaluate the integrals involved in the thermodynamics of liquids 
and gases have not met with complete success. The Kirkwood approximation 
:might not be bad, but we do not know exactly how to generalize it, or whether a 
generalization would give better results. What we want is a method that would 
give progressively more accurate results as we increased the computational labor. 
No such method is available today. 

We are faced with a problem that is too hard for us. We may be able to gain 
some insight by picking a soluble problem that has some of the features of the 
insoluble one. 

For example, consider gas particles constrained to move on a line (Fig. 4.7). 
First we assume only nearest-neighbor interaction. The partition function is 


ZL) = en Pra) +VRI= A) Vd dary s+ dxyy (4.81) 
O<xp<x2<+°+<xN<L 


The length L is fixed. 

We would be making a good start toward solving our problem if we could 
get a recursion relation with Zy,, expressed in terms of Zy. Since we cannot 
even do this, we will modify the problem in such a way that the macroscopic 
properties of the system are not changed but a recursion relation for the partition 
function can be found. We put one extra atom before x, and fix its position to be 
at y (see Fig. 4.8.) The resulting partition function we call Py(y, L). 


P,(y, L) = e PIMiy a1) 4 Vr a2d4 °° -Tdy dxy+++dxy. (4.82) 


y<xy<x2<°°° <b 


In Eq. (4.82) y is fixed. Now, consider the case shown in Fig. 4.9. From Eq. 
(4.82) we can write 


Py.4(z, L) = | e AME FVOmm4t Tdy dx, +++ dxy. (4.83) 
z<y<xzy<re <b 
a xy Xg x3 x4 xy L 
« ~~— a 


Fig. 4.7 Gas particles constrained to move in one dimension only. 


Fig. 4.8 One atom fixed at y. 
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0 Zz y xy x9 xXy OL 


Fig. 4.9 One atom fixed at z. 


In the integral, z is fixed, but y varies. From Eq. (4.83) 


Py+,(z, L) = eee dy 


z<y 
~AIV y= x1) FV rmx) Ty 
x e aed is dx, ++: dxy 
y<xy<e-- <b 


Py(y, Lie 8”@-™ dy. (4.84) 


z<y<L 


Using Py(y, L), we can construct the grand partition function 


N,— BF. nl ny 
e789 — ye e AFn — ¥ ef aie Zn 
N 


N N! \2nh?B 


In our case, Zy/N! = Py(y, L) (the factor 1/N! comes from the fact that we have 
in the integral for Py restricted the range to y < x, <-++: < L instead of 
allowing each particle to be anywhere in [0, L]). We can absorb the factor 
(m/2nh? By’? in the ef" by adjusting p. Then 


ef9 — ¥ oF NPA(y, L). (4.85a) 
N 


As in Section 1.1, g for a combination of several independent subsystems. 
each with the same p, is the sum of the g; for each subsystem. In Section 1.6 we 
defined F = g + pN, where Nis the expected number of particles in the system. 
Clearly, F for a combination of independent subsystems is also a sum of the FS 
for each system. 

If there is a sufficiently large number of particles, we can consider the line to 
be a combination of line segments that interact with each other very little. With 
such considerations, it is easy to see that F/V is a function only of the density 
(at a fixed temperature), and F can be written F = Vf(V/N). It is not hard te 
show that F of the above form is equivalent to the equation g = —pV. The 
proof is as follows: From Section 1.6: 


_ OF _ OF 


Banh 8 avy 
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Using F = Vf(V/N) we then have 
g+ pV =F— UN + pV 


I 
hey 
I 
2 
| 
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Because the ‘‘volume”’ of the system is L — y, we can now write Eq. (4.85a) as 


efP(L-y) _ ¥ tN. (y, L), (4.85b) 


N 


Multiply Eq. (4.84) by e*! and sum over N to obtain 


L 
eP(L-2)8 aa eP(L—y)B - BV(z—y) dy, 
Zz 
which leads to 
L 
eT FH — | e7 PO -2)B 9 BV(z—y) dy, 
4 


or 


ao 
en bu = 7 PPXe- BV(X) dy. 
0 


(4.86) 


(4.87) 


The integral is insensitive to the upper limit, so we have replaced that limit with 
0. This is the desired result from which we now can find all the thermodynamic 


quantities. For example 


Ou Ou Ou Ou 


Equation (4.87) can be written‘as 


0= Bu + log fe er Rite IYO) dx: = flu, B, PCL, B)]. 


¢) 


From 0 = df/du, it follows that 
ép =. of [ep =. ts e PPX BY(x) 


au af [ep (2 xe7 PPX BV)" 
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Problem: When the nearest- and next-nearest-neighbor interactions are in- 
cluded, derive the chemical potential. (Hold two atoms fixed near the left end 
of the line and derive the recursion formula. An integral equation results.) 

We see now that we can solve the problem of particles fixed on a line if the 
potential acts only between the nearest few neighbors. But what if all the atoms 
in the system interact? For a certain special form of the potential we can still 
solve the problem. Section 4.8 discusses the case of an e~ !*! potential. 


4.8 ONE-DIMENSIONAL GAS WITH POTENTIAL OF THE FORM e7!*! 


We will now discuss the problem of a one-dimensional gas, all particles of which 
interact repulsively with all others with a potential V(x) = e7 1, 

When there is attraction and no repulsion, the preferred configuration 
occurs when the particles all sit on top of one another. The potential V is pro- 
portional to the number of pairs, N(N — 1)/2. The more pairs there are, the 
lower the potential can get, and ultimately all particles collapse to a point. This 
result does not tell us much, so we assume a repulsive force. 

Assume N particles are distributed in a line of length L (Fig. 4.10). We wish 
to evaluate 


los) [fffor romeo 
0 


| exp |-6 by ae) dx, ++: dxy 


e7 bFn 


I 
nL 
y 
=" 
SE 
z 
re 


O<xy<+-+<xy<L = 
m N/2 
“Gay” felon tse 
T 
O<xp<+-+-<L 


+ } ae dx, +* dxy. 
j>iz2 
(4.87) 


Fig. 4.10 One-dimensional gas. 
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Suppose, as before, we modify the problem slightly by placing an extra 
atom before x,, and fix the position of that atom at y. Let a = e”. Later we will 
set y equal to zero. The partition function for this system is 


Cae | exp [ae $e? poet oH 


O<y<xy<---<L 


+ YX see) dx, oes dX>. (4.88) 


j>iz1 


Then 


Iy+ (a, L) 


\ 


exp \-p jae se Pres? ae 8) 


O<y<xgp<xy<-++ <b 


+ = aa dxo ees dxy 


j>iz0 


Ml 


exp (— Bae *°) 


O<y<xg<-+-<L 


x exp {pl la emerson emmy 
+ ~ al dx g*** dxy 


Jj>iz1 


L 
[exp ae)Inle + €°, L ~ 9) do (4.89) 
y 

As in Section 4.7, we can again modify the problem in a manner that does not 
affect the macroscopic properties. We allow N to vary, put in a chemical 
potential, and define 


en =P Iya, L)eh®, 
N 


Because I,(a, L) is the partition function for a system with one particle fixed 
at y = log a, its length is L — log a, andg = —p(L — log a). 

Muitiply Eq. (4.89) by e%*!"*, sum, set y = 0 and replace L by oo. The 
result after a little algebra is 


eTHe W [ exp {—f[e~*° + p log (1 + e*)]} dxo. (4.90) 
0 


From Eq. (4.90) we can obtain all the thermodynamic properties, and we can 
reasonably assume that the atom fixed at zero does not change those properties. 
But let us see what we can do without fixing an atom at zero. The force on the 
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first atom, the one located at x,, is due to the rest of the atoms in the gas, and 
the effect of the other atoms can be described with a potential involving one 
extra parameter. Thus the first atom moves in a potential ce*! — 1, where 
c = ¥,e"*. Instead of specifying the position of one of the atoms as an extra 
parameter in the problem, we will specify c, which determines the potential felt 
by the atom at x,. We now calculate G,(c), which is the partition integral: 

L 


[or |- y, ee} dx, +++ dxy 
i<j 
with the range of integration subject to the restrictions that 


ye Sc 
i 


and 0 < x,°:*< xy < L. Mathematically, 


Gy(c) = | exp |-6 2 ial r) (< -> «*) dx,-++ dxy. 
O<xyp<-++<xn<L : 


Note that G,(c) = Oif¢ < 0, and 
L 


N/2 foo) 
e FFs = ( Gy(c) de. 
2nh* B o OL 


To get a recursion relation we will hold x, constant until the rest of the integra- 
tions are done. 
Equation (4.91) can be rewritten 


Gy(c) = [oe |-6 > e7 im) _ B y’ tee 
e i 


i<j 
i 


x 6 (< -e — yr e) dx, +++ dxy, (4.92a) 


where 


Because of the action of the 5-function, the }}; e~*' in the exponent of the inte- 
grand can be replaced by c — e-*!. To make the argument clearer substitute 
xX, = X; — X,. Thus 


L L 
Gy(c) = | dx, | 6 (« —e tet yy e*) 
L F . r 


x exp |-" bi Be y oe dx,--+dxy. — (4.92b) 


t 
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We know that 


5 (< Ree e e*) = e'6 (< eat aa “~) 
because O(ax) = (1/a) 6(x). Equation (4.92b) can be rewritten 
L L-x, ‘ 
Gy(c) = | dx, exp (x,) exp [—(ce* ~ 1)] | 5 ee See | 
L 0 


0 i 


x exp E y uae dx, +++ dxy 


= ( dx, exp (x,) exp [—P(ce* — 1)]Gy_ (ce! — 1). (4.93) 
0 L~-x1 


In general, the solution comes from Eq. (4.93), but it is very difficult to find. 
For subsequent use, let us define (for N particles) 


YX aGa(c) = ePlg(c) (4.94) 
a= eFu 
and 
p = pressure. 


Combining Eqs. (4.93) and (4.94), we find that 


g(c) = | dx exp (x) exp [—P(ce* — 1)]e7®?*g(ce* — 1), (4.95) 
0 

where the substitutions x = x, and extension of the limit L > oo have been 

made. In obtaining the eigenvalues of this integral equation, a relationship will 

be found between « and p; g(c) itself is not of interest. The boundary value on 

Hc) is g(c) = Oif c < 0. To put Eq. (4.95) in terms of differentials, substitute 


h(c) = e7*g(c) 


g(c) = “| dxe*e~ ®?*h(ce* — 1) 


g(c) = «| dxe**e *?*h'(ce* — 1) 
0 
= | e*e” Ax a [h(ce* — 1)]. 
9) dx 


Cc 
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Integrate by parts: 


g'(c) 7 h(ce* — le*e 5P#|2 = a) h(ce* — 1 7 Bp)ex BP dx 
c c Jo 


= OP) ge) - tHe = 1) 
c 


_ hp =") 1) aS : e7 Me“ Doe — 4), (4.96) 


Equation (4.96) appears at first glance to be a differential equation, but is not, 
because g(c) and g(c — 1) enter into it. 
For another form, substitute 


g(c) = eF?!?F(c) 
g'(c) = —Bee™P*!*f(c) + eP*7F"(0), 
Then Eq. (4.96) becomes 


ef '(c) — Be*f(c) — a(Bp — If(c) = —ae*f(c — 1). 


We can use the Laplace transformation 


o(@) = | * en #f(6) de 
0 


to obtain an equation in g(q). Thus: 


\, e *cf'(c)de = — LD ie e *f'(c) dc 
0 aq Jo 


i 


d 
ai [ae(q)] 


and 
ix “*4f(¢) de = - oa). 
0 
Therefore 
Eee 4, aoa + pe 200 + a(Bp — 1)p(q) = ae*/?e~40(q). 
ee 


o(q) = e 7 /*4Y(q), 


we eliminate the first derivative term: 


— BY") = |- v see * peal pp 1) +] w. (4.97) 


49 Brief discussion of condensation 125 


If instead we use Fourier transforms 


o(o) = | e*f(¢) de 


and substitute 
Ww) = e °!*Feo(a), 
we get 


Bw'(@) = (F + aeFl2el? 4 a(Bp — 1) + ») W(o). (4.98) 


The boundary conditions are not known definitely in this transform notation. 
However, it may be that the only restriction necessary is that the function 
¥(q) Or p(@) remain finite everywhere. We can now, in principle, get a functional 
relationship between « and p. 


49 BRIEF DISCUSSION OF CONDENSATION 


For a one-dimensional gas such as the one we have just considered with repulsive 
forces there is no condensation. Let us consider instead the case where there are 
both repulsion and attraction, such as the one depicted by the potential of Fig. 
4.11. At T = 0° all atoms are adjacent to each other in positions corresponding 


to Viin in Fig. 4.11: 


As the temperature is increased there is a continuous transition from the “solid” 


to an aggregate of groups of atoms: QOOO OQCOO Cocco. The 
energy required for breaking up one group of atoms into two groups is roughly 
the energy required to break two atoms apart and we call it «. The probability 
of such a break is e” */*7, and when the temperature is raised it happens more or 


less uniformly. In three dimensions one might have a group of N atoms Sp , 


Vinin 


Fig. 4.11. A potential with attraction and repulsion. 
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The energy required to cut across a group ise = N?/3 A, where A is roughly the 
value of the energy required for taking one atom away from the group. Thus, 
the probability for splitting up the group is e~4’°4/K7, The larger N is, the 
higher T is required to be for a finite probability of breaking up a large group of 
atoms. Thus, in the one-dimensional case, a finite energy is required for breaking 
up a group of atoms, a finite number of groups exist, and the fraction of atoms 
(essentially infinite in number) in the condensed state (that is, in the largest 
group) is zero. In the three-dimensional case, because of the probability factor 
eN?A/KT a finite fraction of the total gas can exist in the condensed state at a 
finite temperature. 

For solids the difference is more subtle. In one dimension, as the temper- 
ature is raised, let the mean square error in the position of one atom relative to 
the ones on each side of it be 6”. Then, over a group of N atoms, the mean 
square error in positions of the first and last atoms is Né? from the perfect 
lattice configuration. The error ‘“‘propagates’’ (in a sense), and when temperature 
is increased, the solid expands monotonically. There is only one path along 
which information can be transmitted. In three dimensions, the position of an 
atom in one place influences that of a distant atom much more strongly than in 
one dimension—there are many paths connecting the two along which forces 
are being transmitted. Thus to melt a solid T must become high enough that 
the thermal energy can overcome these “multipath” forces. 


CHAPTER 5 


ORDER-DISORDER THEORY 


5.1 INTRODUCTION 


We will now consider a new class of statistical-mechanical problems, problems 
that do not involve motions of atoms but instead involve their positions in a 
lattice. More precisely, we will consider alloys made up of two types of atoms. 
For instance, in the cubic lattice shown in Fig. 5.1, there will be some preferred 
arrangement for the manner in which the lattice sites are occupied by type A 
and type B atoms. Let us take the case of zero temperature. Essentially, two 
types of lattice arrangements are possible, one of which is depicted in the figure, 
and they can be understood from a physical point of view as follows. Let V4z be 
the interaction potential between unlike atoms and V,,, Vg, the potential 
between the like atoms. If V4, > (V44 + Vog)/2, then A atoms prefer to be near 
A atoms and likewise for the B atoms. The solid would tend to exist in two 
distinct parts—one of A atoms only, and one of B atoms only. If Vag < 
(Vi4 + Vgp)/2, the A and B atoms would tend to alternate in position throughout 
the solid. If the solid is heated, then gradually A and B atoms will exchange 
positions in a random way until, at the Curie temperature, the order ‘“‘melts”’ ; at 
still higher temperatures, disorder sets in. It is of interest to compute the parti- 
tion function for such a system: 


—BF 
er a ye exp (- >, Br). 
arrangements (i,j,bonds) 


of atoms 


9-0-9 
ae 
O-O-O 


| ! ! 
' | | 


Fig. 5.1 Cubic lattice with two types of atoms, A and B. 
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Define: 
€é; = +1 for A atom 
= —1 for B atom 
N,, Nz = number of A, B atoms, respectively. 
Thus, 


ee; = +1 if AA pair 
+1 if BB pair 
= —1 if AB pair. 


Considering nearest-neighbor interactions only and the total N, + Nz as 
constant, we will write the interaction energy in the form 


Viz = aee; + a) +c 


Then, 
Vag =atbt+e 
Vap =A—b+e 
Vap = —Q + 
and 
2a = AAS Vep _ Vez 


Thus, the exponential in the partition-function equation above is 
exp (-pe » cei) exp (-# X a exp (— Bc). 
ij ij 


The last exponential, e ®°, is a constant and does not enter into calculations. 

In the sum Syonas (€; + €;)/2, each atom contributes to the sum a certain 
number of times. The number of times an atom contributes is equal to the 
number of bonds it makes, that is, the number of nearest neighbors it has. For 
example, in the case of a two-dimensional square lattice, each atom has four 
nearest neighbors; so 


&; + &; 
n> , = = 2(N, — Nz). 


In general, if VN, and Nz are kept constant, we can ignore }° (¢; + &,)/2 because 
it does not depend on the arrangement of the atoms. The problem then is to 
calculate for the partition function the quantity 


exp (~Ba ¥ ee) ~ Yew (—1 Z8): 
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But when we sum quantities over all possible arrangements of the atoms, the 
restriction that N, — Ng be constant may cause difficulties. To relax this 
restriction, we put in the usual “chemical potential.” In other words, we change 
the problem (but not the results of the solution of the problem) by saying that 
N, — Nz can vary. Associated with variation of N, — Nz is a proportionate 
amount of chemical potential, which has the effect of making a certain ratio 
N,/Nz the most probable. In such a case, we cannot ignore 


BT a) Sg. 


bonds 2 atoms 


Our problem reduces to the calculation of 


Lew (-# & e6,-J YD “) 


onds atoms 


where, in summing over possible arrangements, we still take N, + Ng = N 
fixed but no longer fix N, and Nz, separately. If J = 0, we are obviously 
solving the problem of (N,) = <N,). For practice, we will evaluate the above 
sum for the one-dimensional case. Then we will consider an approximate 
method that works for two and three dimensions, and finally we will find the 
sum exactly for two dimensions. The exact solution for three dimensions has not 
yet been found. 

It should be noted that the above model can be taken as a model of ferro- 
Magnetism. We interpret ¢, = +1asspinupande, = —1 as spindown, where 
the atoms are taken to have half-integer spin. A better model for ferromagnetism 
is to take the 


Hamiltonian = H Y' S,-S; 
bonds 


where S;, is the spin operator for the ith atom. This model is not equivalent to 
the model with ¢,¢ ; terms. For example, a two-atom system has 


S,°S, = 3{(S, + S,)? - 3h"), 


which has the value $4? with a priori probability three times as large as that for 
which the value of S,-S, has value —3h?. That is, there are three ways in 
which two atoms of spin 4 can combine to give a system with total spin 1, but 
Only one way in which they can combine to give spin 0. On the other hand, 
&€;, takes on its two possible values with equal a priori probability. 

Although the two models are different, the qualitative results might be 
Similar. For the rest of this chapter we will discuss the model in which the 
energy has terms ¢,e;. This is the Ising model. 
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5.2 ORDER-DISORDER IN ONE-DIMENSION 


For practice, let us evaluate 


e FF = Y exp(-H Vi ege4., —-J¥ 8) 


in one dimension. 
First, suppose J = 0. Then an easy way of doing the problem is to define a 
new variable: 


m=tl 
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(2 cosh H). 


So F = —N/f In (2 cosh H), and we have completed the calculation. 

If J 4 0, the above method does not work. We have to use what is, by 
now, a familiar trick. That is, we make up a new problem that, for large N, is 
almost the same as the old one, but for which a recursion relation can be found. 
We will try to evaluate the partition function for a system in which the particle 
on the end has a fixed s. 

Let 

Xy(én41) = Y oexp[—H(eyeg + 7+ + bytn41) 
gies 
— Je, + &2 a i Ey+y)I- 
Then 


Xy Alen 2) = Di X yey 41) exp [~ Hey 4 18v+2 — Jévs2l- 


The free energy for a large number of atoms should be proportional to the num- 
ber of atoms; therefore 


XwyailEws2) = P*X (En +2)s 


where j: is independent of N for large N. The free energy is then F = ~pN. The 
recursion relation then becomes two simultaneous linear equations 


MXN) = Yo Xe MO" where oy = +1. 
y=Ht1 
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If we think of (Xy(1), Xy(— 1)) as an eigenvector and e** as an eigenvalue of the 
above equation, then it is easy to see what to do next. To find e** it is necessary 


to solve 
-H-J H+J 
e — et e 
0= det ( we)’ 


el-s ents 


The result is 


ef! = eH eosh J + efs/1 + e074 sinh? J, 


There are two solutions here; which one do we want? If we look back at the 
case of J = 0, it is obvious that e becomes 2 cosh H = e~* + e®, so the 
general result is 


ef = e# cosh J + ef/1 + e7*# sinh? J, 


Problem: Try to get the free energy as a function of the ratio between the 
number of A’s and the number of B’s. 


5.3 APPROXIMATE METHODS FOR TWO DIMENSIONS 


You may have noticed that in Section 5.2, none of the equations were considered 
important enough to number. This is because we were not especially interested 
in the answers, but instead wanted to examine some possible ways of looking at 
problems. Perhaps you will be able to improve upon and use some of the 
methods described here, or maybe you will find these methods useful for solving 
a completely different problem. In this section we will nibble at the edges of 
another problem: that of order—disorder for a two-dimensional square lattice. 
For simplicity, let us take J = 0. 

One approach is to find a recursion relation. To do this, find the partition 
function for the case where all the e’s in the last column are fixed. Suppose 
there are M columns of N particles each, and let Xy(e,, &2,..., &y) be the parti- 
tion function. As in the one-dimensional case, the free energy is proportional 
to the number of atoms. It follows that 


ORG fess By) =X y4y(E1,---> Ey) 
= ¥ exp {—H[e,0, + £50, SRO &yOn |} 
@ OM 
x exp {—H[e,8. + °°* + by En] }Xy(O1,..., Oy). 
(5.1) 


The analogous equation in Section 5.2 required for its solution the diagonaliza- 
tion of a two-by-two matrix. The above equation requires the diagonalization 
of a 2%-by-2" matrix. 
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Given the matrix equation MA = 1A with M hermitian, a well-known way 
for finding the solution with the minimum or maximum 4 is to try to find the 
extremum of A - MA/A - A. This method might be applicable to Eq. (5.1). Let 


Xy(E,,..-, €y) = exp {—4H[eye. + °° + Ey pen] }Z(ey,..-, by). 
Then Eq. (5.1) becomes 


PNZ(e,,...,e) = » 5, oP {—4H[e€,e. + +++ + ey_1éy]} 


x exp {—4H[0,0, +--+ + Oy-1Oy]} 
x exp {H[e,0, + °°: + eyOy}}Z(0,,..., Oy). 


We made this substitution in order to get an eigenvalue equation with a sym- 
metric real matrix, that is, a hermitian matrix. Then, to solve this equation, we 
must find the extremum of //D, where 


T= ¥ Z(0,,..., Oy)Z(e1,..., ev) exp {-$H[eye. + ++ + ey—1ey]} 


6,8 
x exp {—4H[0,0, +--+ + Oy—,Oy]} exp {—H[e,, ae + eyOy]}. 
D=¥ IZe,..., endl. 


Of course, it is not likely that you will be able to get an exact extremum of 
I/D. But it should be possible to get an approximate one by cleverly choosing an 
appropriate function Z(¢,,..., €y). To be more systematic, one might choose 
a class of functions Z,(¢,,..., éy) and maximize I/D with respect to a. e%#* 
should be equal to the maximum possible value of J/D. 


Problem: Find as large a value of J/D as you can. In evaluating certain sums. 
try setting 0, = 0, and ey = &,; it might simplify matters if the two-dimensional 
lattice were wrapped around a cylinder. 

We have been considering interactions such that when the temperature is 
absolute zero, atoms of type A and of type B alternate in the lattice. For the 
case of two or three dimensions, if the temperature is low enough, the crystal has 
long-range order. If we know what type of atoms are found at each lattice 
point in one region of the crystal, we know what type of atoms are likely to be 
found fairly far off. Note that in one dimension one wrong atom at a lattice 
point reverses the expected order of atoms on the other side of the defect. 
Thus, in two or more dimensions, we can speak of a sites and B sites, where 
a sites are those lattice points at which an A is more likely to be found. In one 
dimension we cannot do this. For the case of two dimensions, Bethe used the 
concept of « and f sites to describe order-disorder phenomena approximately.” 


* Bethe, H. ‘Statistical Theory of Superlattices,”’ Proc. Roy. Soc., 150, 552 (1935). 
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Fig. 5.2 Two-dimensional lattice with « sites and 8 sites. 


Following Bethe, we will work with only a few atoms of the lattice and will 
try to describe the effect of the rest of the atoms by one parameter. We will look 
closely at only the atoms pictured within the dotted line (region I) of Fig. 5.2. 

The relative probability of a given distribution of atoms in the entire crystal 
is exp (— H Sgonas €:€;) (We are assuming there are just as many A atoms as 
Batoms). If we consider only the five atoms in region I we have to allow for the 
fact that the four outer f sites are affected by region II. We do this approximately 
by saying that the relative probability of a given distribution of the five atoms is 
e" exp (—H > «,¢;) where n is the number of £ sites in region I that have the 
wrong type of atom (A) on them. The sum is only over the four bonds in 
region I. ¢ can be thought of as 


aa 


__ 4 priori probability that a B site in region I has an A atom 
a priori probability that a B site in region I has a B atom | 


By “‘a priori probability” I mean the probability that would be assumed before 
the thermodynamic effects of the interaction are taken into account. That is, 
even if there were no interactions between the atoms of region I, the interaction 
of the four external atoms of region I with region II would still tend to cause the 
B sites to be occupied by B atoms. Notice that 7 is not the total number of sites 
of region I that have the wrong atom. If the a site in the center has a B atom, we 
do not multiply the relative probability by another ¢ because region II does not 
interact directly with that atom. Note also that the atoms inside region I can 
affect each other by way of region II, so ¢ is only an approximate way of describ- 
ing the effect of region II. 

For a given value of e we can compute w, the relative probability that the a 
site in region I is occupied by a B atom. We can also compute w’, the relative 
Probability that a given f site of region I is occupied by an A atom. We require 
that w = w’, because the probability of a wrong atom occupying a given site is 
independent of the site. The equation w = w’ will allow us to fix e. 
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Some of the results of Bethe’s method will be derived below. 


-¥ (relative probability that the & site in region I is occupied by a B atom 
=e and there are n f sites occupied by A atoms), 


Pike « Lees 4 
(") at (4 — n)! 


ways in which nf sites may be occupied by A atoms. It is easy to see that 


. Sy ge; = 4 — 2h. 


bonds 


There are 


Then 
4 4 4 4 
w= y ( es os eo 4H >) ( ) eer — e7 *H(1 =f ee7t)4 
n=0 \h 


=(e# + ee”), (5.2) 
w is a relative, not an actual, probability. The actual probability that the x site 
has a B atom is w multiplied by the appropriate normalization factor. 


Let r be the relative probability that the « site has an A atom. The equation 
for r is the same as that for w, except Ho ~—H. 


r= (e4 + eeH4, (5.3) 


The normalization factor is 1/(w + r); so the actual probability that the site has 
a B atom is 
(e~# + eel)* 
(e~# + ee)* 4 (et 4 ce BS 


w’ oc q (expected number of f sites of region I with A atoms) 


4 
w' = 4 ) al[(relative probability that n f sites are occupied by A atoms and 


n=0 the « site has a B atom) + (relative probability that n B sites are 
occupied by A atoms and the « site has an A atom)| 


= 4 > n (‘) efe BG72" as geen] 
n 


ns 3 
a ¢” e H(4—2n) + ell(4— 2n) 
¥(,2 et 

= ceh(e# + eel)? + ce Mel + ce HY. (5.4) 
Equating w and w’ from Eqs. (5.2) and (5.4), we find 


(e~# + eeM)* = eeh(e7# 4 ee)? + ce” Me# + ee *)?, 


63 Approximate methods for two dimensions 135 


So 


(eo # + ee#)3(e7# + ce! — gett) = ce H(e# 4+ ce7#)?, 
simplifying our equation for e, we get 


—H H 
Reece 2 ee. (5.5) 


ef 4 ge7# 


Notice that if ¢ satisfies Eq. (5.5), so does I/e. This fact holds because nowhere 
in the mathematics of our derivation did we specify that « sites, rather than B 
sites, are the locations for which A atoms are more probable. 

Solving for e?# in terms of e, we find two solutions: either 


—2/3 


2H _ ef TE — 21/3 -1/3 
Or Cie == ue =e" +e (5.6) 


or else ¢ = 1 and A is arbitrary (the latter solution is most easily seen from 
Eq. (5.5)). 

The minimum value of ¢!/? + ¢~ 1/9 is 2; so if H is too small (high temper- 
ature) we must take « = 1. From the definition of ¢, e = 1 corresponds to 
complete disorder. We can expect discontinuities in certain thermodynamic 
quantities or their derivatives at the temperature for which complete disorder 
sets in. 

As an example of a thermodynamic calculation using the above model for 
order-disorder, let us calculate the energy. 

Suppose a bond is an A—B bond with probability p. Then with probability 
Pa given bond contributes energy — H/f, and with probability (1 — p) the 
bond contributes + H/B. The expected value of the energy is 


E = (number of bonds) E (=) +(1— p) 5") = a (1 — 2p) (5.7) 


Where N is the number of atoms. 


an [ee"(e“# + eel)3] 
w+r 


the probability that a given f site of region I is occupied by an A atom given 
t the « site has a Batom. Except for the normalizing factor 1(w + r), this is 
ne of the terms in w’. 


1—- a [ee "(e# + ce #)3] 
wtr 
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is the probability that a given site of region I is occupied by a B atom, given 
that the « site has an A atom. The probability that a given bond between an , 
and a f site is an A—B bond is therefore 


eeH(e7# + gel#)3 ce H(e® + ee #3 
p= see" + ee) +/1— mC ) F (5.8) 
w+r 


A/B is a constant independent of temperature. Using Eqs. (5.7) and (5.8) we can 
get Cy = CE/CT. We get the entropy from the equation 


T 
S(T) = S(T) — S(0) =| Cyr 


At T = 00, we can compute the entropy exactly from its definition. Recall that 
S = —k>, P, |n P,, where P,, is the probability of state n. There are 2% state, 
of our N-atom system, and each has equal probability at infinite temperature. So 


= N 1 1 = 
S = -k2 (5. In 5, ) = Nk In 2. (5.10) 


Our approximate calculation, using Eq. (5.9) gives S = Nk(0.697) instead of 
NkK(0.693). 


5.4 THE ONSAGER PROBLEM 


Consider a two-dimensional square lattice of N atoms. Each atom is designated 
by +1 or —1. That is, the ith atom carries a number ¢;, which is +1 or —1. The 
energy of interaction of nearest neighbors is + Hee; in units of kT. A state of the 
system is denoted by giving the «; for each atom. The problem is to calculate 
the free energy of the system. 
The partition function 
Q= e BEK 
all states 
Q — eM Heit; = T*e7 Hees, 

all states all states 
where >-* and [[* mean the sum and product on pairs of points i and j that are 
adjacent on the lattice. It is easy to see that if the sign of H is changed. 2 

remains the same. For convenience we will write 
Q= y, T* ete; (5.11) 

all states 


and take A positive. 
Since e¢, = +1, 


e —e — ,teezH 
+ &&; = err 
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is obviously true. So 
e#*#s = cosh H + ee, sinh H 
cosh H(1 + ¢,T), (5.12) 


where T = tanh H. (Note that T does not represent temperature in this section.) 
From Eqs. (5.11) and (5.12) we find 


Q = > JI* [cosh H(1 + ee;T)]. (5.13) 
Remembering that for an N atom lattice there are 2N bonds, we have 


Q=cosh%*H YY M*( + ge;T) = 2%Q' cosh?’ H. (5.14) 


all states 


We must now find the modified partition function 


g=5 ¥ WMG + e¢,7). (5.15) 
2” all states 
Expanding the product we use the notation )}* to indicate a sum over all possible 
sets of / different bonds. Then 
2"Q' = y oe y {1 + TX ee; + T7335 (ee ee; ae }. (5.16) 
e,= +1 tn= +1 
Note that in each term, the e’s occur in pairs corresponding to nearest 
neighbors, and no such pair occurs twice in the same product. 
Associate with each pair e,¢; a bond connecting the ith and jth atoms (i and j 
are nearest neighbors). With each term or product of 2/ «’s, we associate a graph 
or set of / bonds. Because no pair ¢,e, appears twice in a given term, no bond 


appears twice in a given graph. For example, Fig. 5.3 gives the graph associated 
with the term 


(€1€2)(€3€4)(E5&6)(E1€5)(Er&6)(E3E7)(Egeg)(E7Eg)(E1 0&1 1 (E1014): 


1 2 3 4 : 

5 6 7 8 : 
© Sg i. "12 0~«~° 
+ 43" 14 15 *16 


Fig. 5.3. A graph or set of / bonds. 
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Fig. 5.4 Example of a closed graph 


Now, 


e=0 and ye Se (5.17) 
e=t1 e= ti 

From Eggs. (5.16) and (5.17), it is clear that only those terms where each ¢; 

appears an even number of times will contribute to Q’. This is equivalent to 

saying that the only graphs that count (or contribute to Q’) are those in which 

each atom or lattice point has an even number (0, 2, or 4) of bonds emanating 

from it. In other words, the contributing graphs must be superpositions of 

simple closed polygons having no common sides. Such graphs may be called 
“closed graphs.” 

Figure 5.4 gives an example of a closed graph. Note that the graph of Fig. 

5.3 is not closed and does not contribute to Q’. Figure 5.5 is another example of 


. . . 


Fig. 5.5 Example of an ‘“‘Unclosed”’ Graph. 
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a nonclosed graph, which contributes zero. Each term in Eq. (5.16) correspond- 
ing to a closed graph has each ¢; occuring to an even power (0, 2, or 4), and 
therefore equals simply T", where L is the number of bonds. Summing over all 
g, gives a factor 2%, which is cancelled by the 2 on the left side of Eq. (5.16). 
Each closed graph of length L thus contributes a term T “ to O'; so 


Q' = X g(L)T", (5.18) 


where g(L) is the number of closed graphs of length Z that can be drawn on the 
lattice. Notice that g(L) = 0 unless L is even, so the sign of T (and therefore 
the sign of H) is immaterial, as we expected. 


Method of Finding Transition Point 


We now digress for a moment to show a method (due to Kramers and Wannier) 
by which the transition point of the Onsager problem can be found. This method 
is not powerful enough to yield an analytic expression for the partition function. 

Consider a closed graph on the lattice. Label the midpoints of the lattice 
squares i; = +1 if inside the graph and y; = —1 if outside. (See Fig. 5.6.) 

Consider a bond (—-—) between midpoints of unlike values. Each of these 
bonds corresponds to a bond of the closed graph (because each -- bond cuts 
a — bond). 

We now have two lattices. The new lattice is formed by putting ‘‘atoms”’ 
described by p; at the midpoint of the old lattice. For the new lattice we can 
discuss Q at a temperature 7’ and such that the interaction energy is / in units 
atkT’. Then 


Qh) = YL Tikelmns, 


all states 


Fig. 5.6 Midpoints of lattice squares labelled + inside the closed graph and — outside it. 
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Ifp; = +; then py; = +1, and there isa contributione**. Ifu; = —p,. 


then the contribution is e~". 


Now, if in a given state there are Z unlike bonds and 2N — L like bonds. 


then its contribution to Q(A) is 
e7 Lhg(2N~L)h 


so that 
Q(h) = e2Nh y m(L)e~ 244 
L 
where m(L) is the number of ways by which unlike pairs can be arranged on the 
lattice. 
But from the preceding argument (concerning -—— and — bonds), m(L) = 


g(L), where g(L) is defined by Eq. (5.18). 
Thus 


Q(h) = e?™"¥ g(Lye"7™, 
L 
Q(H) = 2" cosh?" HY g(L)T". 
L 


Let us now define h by e~?" = T = tghH. Then: 
Q(h) = tanh” HY g()T' 
i 
and we get the identity 
Q(H) = 2" cosh?% H tanh’ H Q(h). 

Now, when # is large, h is small; if Q(A) has a singular point, so does 
Q(h). That is, if there is a transition at H,,, there is also a transition at ,,. If we 
assume that there is only one transition or Curie point, H,, = A,, and H,, = 
—41n (tanh H,,). This can be solved to give 

H., = } log (1 — V2). 
You might hope that we could use the identity 
Q[x] = 2% cosh?" x tanh® xQ[—4 In tanh x] (5.19) 


to find Q(x) for many values of x given Q(x) for one value of x. Just plug in x on 
the left side of Eq. (5.19) and get Q evaluated at another value, 


x’ = —4£l1ntanh x. 
We can then get Q evaluated at 
i 


x” = —4 In tanh x’. 


Unfortunately, x” = x, so this method is not useful. 
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Continuation of Onsager Problem 

Call A(/) the number of ways that, starting from the atom (origin), we can 
proceed in / steps (/ bonds) through the lattice and return to the origin without 
using the same bond twice. In A(/) we do not count separately the two or more 
different paths we can use to form the same polygon. 

For carrying out the calculation we shall need the sum over all different / 
polygons of the crystal. Our A(/) polygons for atom A in Fig. 5.7, say, will be 
counted again in the A(/) polygons of other atoms, say B in Fig. 5.7. Each 
polygon will occur / times. Therefore, the number of polygons per atom is A(/)/I. 
Let 

a SAO (5.20) 
i=1 1 
We have 
Q=YoDT' = Y T', 
L closed 
diagrams 
but need an expression for Q’ that we can hope to evaluate. We will work up to 
such an expression by finding an approximation to Q’, which we will then correct 
by means of a trick. 

A term from a closed diagram can be thought of as a product of terms made 
up of graphs formed by polygons. The contribution to Q’ from no polygons is 
simply 1. The contribution from graphs consisting of one polygon is Nq, for the 
number of graphs consisting of one polygon of length / is NA(/)/7. The contribu- 
tion from pairs of polygons, one of length / and the other of length /’ is (NA()//) 
(NA(I')/1')T'*" so the contribution from pairs of polygons of any length is about 


Lu (NAD (NAL) pise 
= ar | Gea Be 


. A . . 
J 1=8 


Fig. 5.7 A polygon formed by proceeding through the lattice back to the starting point, 
without using the same bond twice. 
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where the factor of 1/2 is to account for the fact that we count each pair twice, 
So the contribution from pairs is about }N*q?. I say “about” because we have 


counted such graphs as 


even though that pair of squares is not “closed,” since one bond appears twice. 
Furthermore 


is a pair of squares and is also a single polygon of length 8,-so we have counted 
it twice. If we continue to ignore such errors, we get 


Q' = 1+ Nq + 3(Ngy + 41(Ng)? +- +> = eM. (5.21) 


Our next step logically should be to correct q in order to get Q’ right. We 
can correct g by correcting A(/). Let us write down a few of the correction terms. 
Because of the presence of polygon 5.8a 


|_| 
Fig. 5.8a 


in the first-order term in Q, we obtained the forbidden graph of 5.8b 


| Fig. 5.8b 


in second order (i.e., as a product of two polygons). To cancel graph 5.8b, we 
insert graph 5.8b into A(8) with a negative sign. But then in Q’ we seem to get 


terms such as 
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in both second and third order. In second order, there are two terms that form 
a diagram of type 5.8c, and both have a negative sign: 


LJ ae Fig. 5.8¢ 
coo 


that is, there are two ways of writing 5.8c as a product of 5.8a and a 5.8b type 
term. In third order, one of the two second-order terms is canceled by the 
positive product of three type (a) terms. To cancel the other second order term 
of type (c), we must insert (c) with a positive sign into A(12). 

In Eq. (5.21), we counted polygon 5.8d twice; therefore when we get a rule 
for what terms belong in our corrected A(/) and what their signs should be, we 
will have to have an excuse for not counting (d) in the corrected A(/). 


and 


Fig. 5.8d 


Now, let us look at where we stand. In evaluating A(/) we expected trouble 
because of the restriction to certain allowed polygons. In correcting A(/) we 
found that there must be other polygons with not necessarily positive contribu- 
tion, and some of the allowed polygons should not be counted. The natural 
thing to do is assign each polygon an appropriate weight, and not ignore any 
polygon in the sum. Recall that formerly we did not count separately the two 
Or more different paths we can follow when starting our from an atom and 
forming a polygon. Now we will! count all the different paths, but we will choose 
the weight of each path so that our corrected “q”’ will lead to the right value of 
Q' when we use Eq. (5.21). 


A Topological Theorem 
Give a weight « = e'"/* for a left turn and « ! = e7‘*/4 for a right turn, and 


keep track of the left and right turns as a closed path (returning to origin) is 
traversed in the lattice. The topological argument of Kac and Ward is that the 
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closed graphs (or ones that we want to count) will be counted, and the forbidden 
graphs (e.g., Fig. 5.5) will cancel out when different ways of traversing them are 
considered. That this theorem is true for simple cases will now be demonstrated 
by a few examples. The complete proof was given by Sherman* and is rather 
involved. 

Consider the simple closed polygon in Fig. 5.8a. 

Starting at A and returning to A in a clockwise direction we note that 4 
right turns were made giving a total weight of 


a” * = (e7 in/4)4 = —], 
Proceeding in a counterclockwise way gives a weight of 
(a)* — (eit/4)4 = =], 


The total weight is thus —2, and the simple polygon is indeed counted. To get 
the correct weight of 1, we multiply everything by — 1/2. 
Now, consider the polygons in Fig. 5.8e and e’: 


ti} “J 


(e) (e’) 
Fig. 5.8 


In Fig. 5.8(e), the path is traced out in such a way (starting from A) that there 
are 6 left turns and 2 right turns for a total weight of «°(¢~7) = (a+) = —1. 
In Fig. 5.8(e’) the path is traced out in such a way that there are 4 left turns and 
4 right turns for a total weight of a*«~* = 1. The total weight then of the two 
clockwise paths is ~1 + 1 = 0. It is clear that the two counterclockwise paths 
have weight 0 also. 

Similarly you should be able to show that 5.8(b), 5.8(c), and 5.8(d) and all 
simple polygons give the correct contribution (remember that everything !s 
multiplied by — 1/2). 


Method of Calculating partition Function 


Let q denote our corrected value of g. Then 


ry {corrected h(1)} T! 


1=1 1 
and 
ge Lig GES, 25 ¥ {corrected h(J)}T' 
dT 11 


* Sherman, S., J. Math. Phys., 1, 202 (1960); 4, 1213 (1963). 
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which is the sum over all paths leaving and returning to the origin of a complex 
“amplitude” or weight for each path. This amplitude is the product of ampli- 
tudes for each lattice point passed: 


T if one continues forward: 
To. if one turns counterclockwise: a = e'/*; 


Ta if one turns clockwise: a! = e7'*/4; 


0 if one turns 180°. 


In our rule for determining the amplitude for each lattice point passed, we 
have neglected to say how to deal with the first step, whose amplitude depends 
on the last step. In what follows, we will calculate the amplitude to start at the 
origin and arrive back at the origin from below, that is, moving upward. Thus, 
we will choose as our rule for the amplitude of the first step: 


T if one starts upward; 


Ta if one starts to the left; 
Ta~* if one starts to the right; 
0 if one starts downward. 


More generally, we will use the above rule for the first step and work with 
amplitudes for starting at the origin and arriving at any lattice point (x, y). 
We shall have to describe the amplitude by giving four components: U(x, y), 
the amplitude to arrive at x, y moving upward; D(x, y), the amplitude to arrive 
moving downward; R(x, y), the amplitude to arrive from the right; and L(x, y), 
the amplitude to arrive from the left. This, as mentioned before, is because the 
amplitude to make the next step depends on the arrival direction, and we shall 
have to keep track of it. 

Roughly, what we will do is calculate the amplitudes for arriving at (x, y) 
from various directions in exactly # steps, and then sum over n. U,(x, y) is the 
amplitude to arrive at (x, y) in exactly n steps with the last step upward and 
Similarly for D,, R,, and L,. We assume that in the zeroth step we arrive at the 
Origin moving upward. Thus U (x, y) = 5x,05y,9, Do = Ro = Ly = 9. 

In other words, the amplitude to arrive in zero steps is one if we arrive 
Upward at the origin and zero for any other point or any other direction of 
arrival, With these amplitudes for the zeroth step, we can get the amplitudes 
for any number of steps by recursion. We can write for the amplitude to arrive 
Upward at (x, y) in + 1 steps in terms of amplitudes to arrive from (x, y — 1) 
i n steps. 


Ons, y) = TU,(x, 23 1) A TaL,(x, y _ 1) of Ta *R,(x,y = i). 
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If we came to x, y — | going up, we have an additional amplitude factor 7. 
If we came to x, y — | from the left, we need to turn counterclockwise and add 
a factor Ta, and so on. We cannot get to x, y going up if we previously entereq 
x,y — 1 going down, for we do not allow paths that reverse themselve, 
Similar expressions for R,(x, y), L,(x, y), and D,(x, y) are easily written. 

Then we see that our assumption for the amplitude of the zeroth step wil] 
give, for the first step, the correct amplitudes (such as T if one starts upwards), 
When summing over 2, we will eventually have to subtract out the n = 0 term, 
because we do not consider a figure with zero side to be a polygon, even though 
it starts and ends at the origin. 

To simplify the summation over a», we define the transform of U,(x, y) 


U.En= 4 ¥ UC, yee ™. (5.22) 
x=—-0 y=—-o@ 
Then 
2n (2 7 dé dy 
U(x, y) = | | eeu (én) Hea 
o Jo (2x)? 


For example, the transform of U,(x, y — 1) is e~u,(é, n), and the transform of 
U (x, y) is 1. The transformed equation then reads 


Unsi(6,n) = Te"U,(E, 0) + 0-e°"D,(E,n) + Tae™L,(E, 0) 
+ Ta 'e~"R,(E, n). (5.23) 


and similarly for Dy s(6; n), D+ (6, n), and Ry+iC6, n). 
If we let w,, be a four-component symbol (column vector) with components 
(U,, Dy, Ly R,)s We can write the matrix equation 


Wno im = TMC, nls, 0) (5.24) 


where M(E, 7) is the matrix 


| ein 0 ae go te7in 
0 ei” a7 tein aei” P 
M x | 5, - -_ 5.25) 
(¢, n) lave qgeT gis 0 ( 
nn ail 0 ee | 


Wo(E, 7) = (1, 0, 0, 0). 
Wi(6,) = TM 
W. = TMY, = (TM) Wo 


Wi, = (TM) Wo. 
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Then 
y= 2) (TM) "Wo = 1— TM Wo 


js the transform of the amplitude to arrive in any number of steps. The ampli- 
tude to arrive at the origin moving upward is 


. 2n f2n 
+ ¥ U,0,0) = -14 Ho" [ | 20 ind 
n=0 § ; 


1 dé dn 
x 
i_ TM? (2n) 


= aé dy 
ee Cerra 


We could have arrived at the origin moving in any direction, so we must sum 
over all four values of wo, that is Wo = (1, 0,0, 0), wo = (0, 1,0, 0), wo = 
(0, 0, 1, 0), wo = (0, 0, 0, 1). The result is a trace. Thus, 


-27 a -|" Es i feeer -1| dean (5.26) 
i—- TM (Qn? 
2nd dé dy +. T 2 a 4 
ae [ [ (Qn? T+ [ (1 — 1M) ~ ; | i 
oe rere Mereeer 
(Qn)? (1 — 1M) 


y U,(0, 0) 


dé dn _ 
an [, (ony? Tr [log 1 — TM)] 
iN aN i = : log det (1 — TM), (5.27) 


we have used the following theorem: 


Tr log A = log det A. 


To prove this identity, note that 
N 
det e? = lim (det e?/%)*¥ = lim ee (1 + x) 
N>o N>@ N 


1 N 
lim (: + -—Tr 8) = eB. 
N 


N>o@ 


HT 
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Therefore, log det e? = Tr B. Let B = log A. Then Tr log A = log det A. 
M is given by Eg. (5.25), and one obtains 


det [1 — TM] = (7? + 1)? — 2T( — T?)(cos E + cosy). (5.28) 


Summarizing, we see that 
BF/N = —\n Q/N, 


Q = 2%(cosh?% H)Q’, 
Q’ = e% 
q= Aa eS log det (1 ~ TM), 
2Jo Jo (2x)? 
det (1 — TM) = (T? + 1)? — 2T(1 — T?)(cos € + cos n) 
T = tanh H. 


Putting all this together we obtain* 


BF 1 alee aa (oC ; 
— —In2—- a { ae In [cosh? 2H — sinh 2H(cos € + cos n)]. 


(5.29) 


5.55 MISCELLANEOUS COMMENTS 


In the above we sometimes assumed that the lattice was finite (with N points) 
and we sometimes assumed it was infinite (when we neglected the boundary for 
example). Because N was assumed to be very large, it is plausible to suppose 
that our inconsistency made little difference. However, let us modify the above 
work to make it consistent. 

Assume the lattice has the topology of a torus. If we pass through L points 
in one direction (or L’ points in the perpendicular direction) we finish back 
where we started. Formerly, to compute the amplitude to arrive back at the 
origin moving upward, we took the sum 


y U,(0, 0). 


1 


n 


Now, we must take the sum 


fee) eo 
> 2D UL, qv). 
p=—-0 q=-H n=1 
* A slightly different treatment for the Onsager problem was given by N.V. Vdovichenko, 
Soviet Phys. JETP 20, 2, 477 (1965); also see Soviet Phys. JETP 21, 2, 350 (1965). 
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Continuing as before, we get in place of Eq. (5.29), 


BF _ = ee 2 Eos ipLé pigh'n 
N 2 Jo (2n)* 


x In [cosh? 2H — sinh 2H(cos € + cos n)]. 


y gn = y 6 5(¢- 


p=-o@ n=—@ 


BF_ ing § (28) (27) 1 
yoo Ge eles 


x In | cosh? 2H — sinh 2H (cos a" +c os] . (5.30) 


But 


So 


As L and L’ tend to infinity, Eq. (5.30) approaches Eq. (5.29). 

A few comments about the form of Eq. (5.29) are in order. If we use the 
expression 

BF = -log ¥ e 
States 

we can verify that Eq. (5.29) is correct for high temperatures (H — 0) and low 
temperatures (H > 00). Another check of Eq. (5.29) is by way of Eq. (5.19). If 
we consider Q as a function of y = sinh 2H instead of as a function of H, Eq. 
(5.19) can be written in an especially simple form. Because sinh 2H = 2 cosh H 
sinh H = 2 cosh? H tanh H, 


Q(y) = 2" cosh?’ H tanh® HO (|) = sinhY 2H ()) = y"9 (|)  (5.19a) 
y y y 


Equation (5.29) can be written 


2n 
fF = -in2~4iny a [, a Jy + 1 os é + cos). 


(5.29a) 


from which it can be seen that Eq. (5.19a) holds. 

Kac made the conjecture that the solution for the three-dimensional 
Onsager problem (cubic lattice), is of the same form as Eq. (5.29) except that 
cos € + cos 7 is replaced by cos € + cosy + cos ¢ and dé dy/(2n)* by dé dy 
al/(2n)>. However, this conjecture is false. 

Instead let us conjecture that the specific linkage of the Ising lattice will not 
8reatly affect the character of the transition point in the specific heat curve. 
Also, an approximation starting from the transition point should prove very 
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valuable. If a magnetic field is added to the Onsager problem, the problem o; 
evaluating the partition function can again be converted to a path problem. 
which has not been solved. The problem here is to keep track of the area en. 
circled as well as the length of the path. 

We have done a lot of work just to evaluate the expression 


e=t1 


o> > xp (-H x nt) 
bonds 


One reason for going to all this trouble is that maybe you will be able to generaj- 
ize the methods used, or apply them somewhere else. For example, we considered 
a statistical-mechanical problem for which we eventually had to evaluate a sum 
over all paths leaving and returning to the origin, with an amplitude depending 
on the preceding step. This is essentially a Markovian walk, with each step 
depending only on the one previous step. A Markovian-walk problem, however. 
can be handled when each step depends on a specific finite number of preceding 
steps. In a way analogous to the method used above, Q, can be found for the case 
where the amplitude of a step depends on the n steps preceding it. The question 
is: Can one find an interesting problem whose partition function is Q,? 


CHAPTER 6 


CREATION AND ANNIHILATION OPERATORS 


6.1 A SIMPLE MATHEMATICAL PROBLEM 


In this chapter we shall describe an operator formalism that has widespread 
applications in quantum mechanics, notably in dealing with harmonic oscillators 
and in describing many-particle systems. 

We begin by formulating and solving the following simple problem: 
Suppose an operator a satisfies 


(a,2 "ec; (6.1) 


The problem is to find the eigenvalues of the Hermitian operator a*a, and to 
relate the eigenvectors. (Note: a~ denotes the Hermitian conjugate of a, and 
[A, B] is, of course, the commutator AB — BA.) 

We first note that, if |w) is a normalized eigenvector with 


a* ala) = ala), (6.2) 
then 
a = Calatalx)> = llalx) |? > 0. (6.3) 


That is, the eigenvalues are all real and nonnegative. Using the identity 
[AB, C] = A[B, C] + [4, C]B, we observe that 


[a*a, a] = [a*, ala = —a, (6.4) 
(a*a,a*] + a*[a,a*] =a‘; (6.5) 
Or, equivalently, 
(a* aja = a(a*a — 1), (6.4’) 
(a*aja* = a*(ata + 1). (6.5') 


From Eq. (6.4’) we have, for an eigenvector |x), 


(a*a)aja> = alata — 1)|x) = a(x — 1)|a> = (@ — Nala). (6.6) 


Therefore ala> is an eigenvector with eigenvalue a — 1, unless aja) = 0. 
Similarly at |a> is an eigenvector with eigenvalue « + 1, unless a*|x> = 0. The 
Norm of ala) is found from 


lala) |? = <ala* ala) = xXa\2> = 2, 
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or 
lala) || = Va. (6.7) 

Similarly, 
lla* |x» || = al oe Ms (6.8) 


Now, suppose that a"|«> # 0 for all ». Then by repeated application of 
Eq. (6.6), a"|x> is an eigenvector of a* a with eigenvalue « — n. This contradicts 
Eq. (6.3), because « — n < OQ for sufficiently large n. Therefore we must have 


aja) #0 but a®*'|a) =0 (6.9) 


for some nonnegative integer x. 
Let |e — n) = a"|n)/|la"\n) ||, so that |x — n) is a normalized eigenvector 
with eigenvalue a — ». Then from Eqs. (6.7) and (6.9), 


Va ~n= |ala ~ n)| = 0, 


and therefore a = n. This shows that the eigenvalues of a*a must be nonnega- 
tive integers, and that there is a “ground state” |0) such that 


a0) = 0. (6.10) 


By repeatedly applying a* to the ground state we see that (a*)"|0) has the 
eigenvalue n and, because of Eq. (6.8), it is never zero. Thus the eigenvalues of 
a*aare 0, 1, 2, 3,.... 

If |”) is a normalized eigenvector with eigenvalue n, then, from Eq. (6.8), 


In — 1) = (IJV a)alny 
is a normalized eigenvector with eigenvalue n — 1. Also 
a*|n — 1) = (WV aataln) = Vain). 


So applying a* to |n — 1) gives us back |n} (within a factor), rather than some 
other state with eigenvalue x. 

We may then construct the eigenstates of a*a as follows: First we find 4 
state |0) such that 


al0) = 0. (6.11) 


(|0> may be unique; if not, we find other operators commuting with aand a’. 
and classify the |0>’s according to their eigenvalues.) Then we define 


ae 


V2 


lly = at0>; 22> = Latty = Lat; 
a3 
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and in general 


In) = = (a*y"0). (6.12) 
ni 


(Note that we could have included arbitrary phase factors in the definition of 
jn); Our convention here is to make them unity.) With this definition, the |n) 
are orthonormal* and satisfy 


a*|n) = Jn + 1\n + 1) (6.13) 
a\n) = Vn|n — 1) (6.14) 
a*a|n) = n\n). (6.15) 


Equations (6.11) through (6.15) form the answer to the problem posed at the 
beginning of this section. 

The operators a* and a are called “raising” and “lowering” operators, 
respectively, because they raise and lower the eigenvalue of ata. In later 
applications a*a will be interpreted as the observable representing the number 
of particles of a certain kind, in which case a* and a are called “‘creation” and 
“annihilation” (destruction operators, or ‘‘emission” and “absorption” 
operators. Equations (6.13) and (6.14) may be alternatively expressed in terms 
of matrix elements: 


(mia* |n) = Vn + 1 Spats (6.13') 
(mla|ny = V1 Smn—1: (6.14’) 
* For, by (6.12) we have 


(nlm) = (Ola"(a*y"|01 nim). 
From Eq. (6.1) we easily obtain 
[a,(a*)"] = n(aty", 
so that 
Ola"(a*)"|0) = la"~*(a*)"a\0> + (O|na"~*(aty"~*\0) 
= n(la"~'(a*y"~1\0) 
= n(n — 1)---(2 — m + 1)Ola"~"10) 
= HY Om 


and the orthonormality follows. 
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6.2 THE LINEAR HARMONIC OSCILLATOR 


Our first application of the results of Section 6.1 will be to the one-dimensional 
harmonic oscillator, which has a Hamiltonian of the form 


ben (6.16) 


where x and p are the position and momentum operators for the particle and 
satisfy 


[x, p] = ih. (6.17) 


Our task is to find the eigenvalues and eigenstates of H. 
Note that f (ma@/h)x and (1/V ma@h)p are dimensionless. Let us define 


a i ( (t= 7 tin P). oe 


Because x and p are Hermitian it follows that 


+ ple ae 6.19 
. -(/% : i»). ( ) 


From Eg. (6.17) we obtain 
[4,a*] = 1. (6.20) 


Expressing x and p in terms of a and a*, we have 


hoa+a* 


= aie 6.21) 
mo 4/2 ( 
Se at 
p = Vmoh (6.22) 
id 
We get, for the Hamiltonian, 
a ho + +) _ + 1 13 
H =~ (a%a + aa*) = hofa*a + 4). (6.23) 


Thus, the eigenstates of H are those of a*a. Now we can apply the results of 
Section 6.1, obtaining the eigenstates |0>, |1), ]2>,... that satisfy 


Alny = (2 + Hhaln). (6.24) 


The energy levels are thus E, = (n + dha. 


6.2 The linear harmonic oscillator 155 


The eigenstates themselves are given by Egs. (6.11) and (6.12). We can 
easily obtain the wave functions 9,(x) = ¢x|n) as follows: from Eqs. (6.18) 
and (6.11), 


0= alo> = [2° (x ca —— ?) 10). (6.25) 


0= [a (x + a a) <x|0> (6.26) 


(where x is now a number, rather than an operator.) Equation (6.26) is merely 
Eq. (6.11) in coordinate representation, in which it takes the form of a differ- 
ential equation. Solving it, we get 


<x|0) = Ae” (me 2h)x? 


where 4 is a constant. Normalization requires that 
fea) 


<O|x><x]0> dx = lar | e (ma/Adx? Jy 


— 0 


1 = 00> = | 


— oC 


= Al? iE 
mo” 


$0 


Ae ES 7 
th 


The phase @ of A is arbitrary, and we set it equal to zero. Then 
rea 
th 


1/4 
éx|0> = = eo (mel 2i)x?, (6.27) 


So 


We have thus found the wave function for the ground state. For the other 
States we apply a* according to Eq. (1.12): 


<xiny = 4 cxl(aty00. (6.28) 
anil 
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Since 
mo i 
at = {/— (x|{x — — 
cl me ¢ i( r) 
— jm@(,_ fd éxl, 
2h ma dx 
we have 


1 /mo\"? h dV 
<x|n> = Sa (e) (x - on <x|0> 


1/4 n/2 , a 
CGY kayo 
niit mo dx 


The matrix elements of observables between harmonic oscillator states can 
be found without having to express the states in coordinate representation and 
integrating over x. We simply express the observable in terms of the raising and 
lowering operators. An example of this procedure is given in the following 
section.* 


6.3 AN ANHARMONIC OSCILLATOR 
Suppose a system has the Hamiltonian 


2 2 
Ha Py MO 4 xt (6.30) 
2m 2 
Assume that 4 is small enough (« hw) that we can use first-order perturbation 
theory, treating Ax* as a perturbation of the Hamiltonian (Eq. (6.16)). Then 
the perturbed energy levels are 


E, © (n + dho + A,, (6.31) 
where 
A, = <nlAx*|n). (6.32) 
From Eq. (6.21) we have 
h 2 
A, = 4 (s53) <n\(a + a*)*|n>. (6.33) 
2mo 


* Problem: Prove that 


>= "(yon [-le PY) 


where f(a*) is interpreted as >* a,(a*)", when f(x) = 2, @nx". From this formula, find 
a generating function for <x|”). (Hint: Prove first that [a, f(at)] = f’(at).] 
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Expanding (a + a*)* gives us 16 terms, but, thanks to the raising and lowering 


roperties of a* and a, the only terms giving a nonzero expectation value are 
those with two a’s and two at’s: 


<n|\(a + a*)*\|n> 


<n\(a*a*aa + a*aa*a + a*aaa* 

+ aa*a*ta + aa*aa* + aaa*a*)|n) 
nin — 1) + n? + n(n + 1) 
+r(tn+1)4+ 4+ 1? + (n+ Im t+ 2) 
6n? + 6n + 3, 


Il 


1 


where we have used Eqs. (6.13) and (6.14) repeatedly. Therefore 


2 

A, = 34 (sis) (2n? + 2n + 1). 
2mo 

6.4 SYSTEMS OF HARMONIC OSCILLATORS 

Suppose a system has the Hamiltonian 


1 
H=! p+ PD v,00, (6.34) 
mM; ij 


where Q; and P; are canonical coordinates and momenta: 
[Q;, Q;] amt [P;, P;] = 0; [9i, P;] = id j;, (6.35) 


and V;; = V;;. To simplify the presentation a little let us make a change of 
scale, defining 

Yn vm; Q;; pi = Pin m, (6.36) 
and 


eee A (6.37) 


Then g; and p; are also canonical: 


[i Pi] = 1hd,;, (6.38) 
and in terms of them the Hamiltonian is 
H=43) pp + 3¥ Uijaig;. (6.39) 
i i,j 


We shall express H in terms of raising and lowering operators as we did for the 
One-dimensional oscillator. The procedure involves two steps: the first is 
mding a set of normal coordinates 7, with respect to which the potential is in 
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diagonal form, and the second is expressing the coordinates and momenta in 
terms of raising and lowering operators. 
Let the coordinates g; and g, be related by 


= Dy Cudi- (6.40) 


Because (U;;) is assumed to be real and symmetric, the transformation matrix 
C,,; that diagonalizes it is orthogonal: 


by Culp: = Sap; > CuiCaj = 6;;- (6.41) 
The inverse transformation of Eq. (6.40) is then 


= Y Cede: (6.42) 


We further assume that the eigenvalues of (U,;) are all positive, that is, that 
the matrix is positive definite (this ensures that g; = 0 is a point of stable equi- 
librium). Denoting these eigenvalues by w?(w, > 0), we have 


> CyiC Ui = 0758s 
tJ 
and thus 
y Uyaia; = Y ona. (6.43) 
Le) a 


Finally, we define p, in such a way as to preserve the canonical commutation 
relations: 


Dy > py CuPis (6.44) 
[Gas Bg] = ih yp. (6.45) 

The result of our efforts is that 
H= tL + w74%), (6.46) 


which means we have a system of decoupled harmonic oscillators (one for each 
value of «). 
Using the methods of Section 6.2, we form lowering and raising operators 


for each mode: 
= (vou de + 


a, = — = Fs) ; (6.47) 


2 ve, 
az eer = (ve. da - ae — Ps); (6.48) 
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de = Jt (a, + a3), (6.49) 
20, 
B, = -i J ee (a,.= az). (6.50) 
Then, 
[a,, 4s] = [az, ag] = 9, (6.51) 
[a,, ag] = d,p, (6.52) 
H = ¥ ha,(aza, + 4). (6.53) 


The eigenstates of H are described by giving, for each a, the eigenvalue n, of 
aja, Thus, 


H\[nyn2n3°°*> = »} (nz + 4)ho,|nyn2n3°°* >, (7.54) 


[nyngny-*- > = [0 al 1000 --- >, (6.55) 
a J/n,! 


where the ground state |000-: - - ) is defined by 
a,|000--- > = 0 for all «. (6.56) 


Note that the energy of the ground state is ), 4#@,. For a system with 
infinitely many degrees of freedom (which we will consider shortly), this quantity 
will generally be infinite. Because the zero point of energy is a matter of defini- 
tion (only the difference between levels being of physical importance), it is 
convenient to redefine the Hamiltonian of such a system so that the ground- 
State energy is zero. Thus, if we let 


H=4) (pi + wigs — Heo,) (6.57) 


with a corresponding (but more complicated) expression in terms of the original 
Coordinates qg;, then 


H = ¥ ho,a7 a,, (6.58) 
and : 
H|nyn2n3-+: > = ¥.n,he,|nynzn3°** >. (6.59) 


6.5 PHONONS 


The states of the system considered in the preceding section can be given a 
Simple interpretation in terms of ‘“‘noninteracting phonons”. Assume that the 
Hamiltonian is given by Eqs. (6.57) and (6.58), so that the energy of the ground 
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state is zero. The ground state is then called the ““vacuum state’ and represents 
the state of the system in which there are no phonons. If the system is in the 
state |njn.n3---) we say that there are n, phonons of type a(a = 1, 2, 3,...). 
The n, are called ‘occupation numbers.” Note that the energy of this state is 
nha, + n,hw, +--+, so that the energy of a single phonon of type « is ha,, 
and the total energy is the sum of the energies of the individual phonons. In 
other words, the phonons are noninteracting. 


Since 
Hae eee | 
a, [ny . Ng—yNgNg+1 > = n, se I|n,..., Mg-45 (n, + 1), Mgt.’ Ds 
(6.60) 
a,\ny "Ng 1NgNat1 °°" > = Vnjln, sey Ng-1, (n, ro 1), Nair ** »s (6.61) 


we may call ay and a, creation and annihilation operators for phonons of type «. 
The operator for the number of phonons of type « is afa,, and the operator 
for the total number of phonons is 


N = ¥ aj a,. (6.62) 


Let the vacuum state be denoted by |0), and let 
\z) = at|0) = |0,..., 0, 1,0,...), (6.63) 


be the state with n, = l anda, = Ofor a’ # «. A phonon, then, is the system 
that is decribed by the states |) (i.e., the system whose quantum-mechanical 
Hilbert space is spanned by the |a)). If |a> and |B) are (one-) phonon states, 
then 


<alB> = (Ola,az 0) 
= <Ol(az a, + 5,5)|0> 
= Oxp (6.64) 
so that the states ja) are orthonormal. To each normal mode of vibration of the 
original system of harmonic oscillators corresponds a one-phonon state (since 
they are both indexed by a). 


We can also use a similar notation for states of two or more phonons. 
defining 


legen 2 tte ae eas (0). (6.65) 


These states are normalized as they stand if a,,..., a, are distinct; otherwise, 
they have a norm larger than 1. Assume for definiteness that « takes on the 
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yalues 1, 2, 3,.... Then the state with 7, of the «’s equal to 1, 7, «’s equal to 2, 
and so on, is 
Myo. 1, 2y.05 20.) = Cat ™(ady" «= 10) 
—_ 


—_——" 
nt n2 


= Vn! ng!-o* [nny s+) (6.66) 


so that its norm is afnit Nylree. 

When we deal with only a few phonons at a time, it is usually more con- 
venient to use the notation |x,,..., %,)> rather than the occupation-number 
description |n,7,---), especially when x can take on a continuum of values. 


The effects of creation and destruction operators on |a,,..., %,) are 
Ge Wiese Oe = lees Stay (6.67) 
n 
Gleniess te = p> Op AG Gs cat Bias Opes tals (6.68) 
=1 


Equation (6.68) comes from Eq. (6.65) and the relation 


n 
aay, eal |e RAR TOCG bua + Seg 
[a,, ay, a,,) = »» Saya Ga, Fans ay," 
=1 


Note that phonons act like Bose particles (insofar as we can call them 
particles*), as an arbitrary number of them may be in any given state (i.e., 
la, ...,a exists for any number of a’s). Their Bose nature is also reflected in the 
symmetry of the states (e.g., |, B) = |B, «>). In Section 6.7 we shall show how 
the ordinary rules for quantum-mechanically describing systems of many Bose 
particles lead to a set of states and operators with the same form as those 
obtained here, so that the interpretation of the oscillator as a system of many 
Bose particles is correct. 

We conclude this section by considering the qualitative effect of an anhar- 
monic perturbation on the oscillator system. Suppose the perturbation has 
terms of the form 


bh CECT and Vs jij M4 
i,Fk 17k 
In terms of creation and annihilation operators, the cubic terms are of the form 
tatat ptt 
Os g Oi 0 Op Gy 8 Aida, 


which always changes the number of phonons (e.g., the first term creates three 


* The phonons will look more like particles (e.g., carrying momentum and energy) 
when the oscillator system is a field as described in the next section. 
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new phonons). Thus, if we start with a definite number of phonons and let the 
Hamiltonian drive the system forward in time, 


I(r) = e~*1Y(0)> 
= |W(0)> — it |Y(0)) + O(¢7), 


we will soon start finding different numbers of particles. The quartic terms 
similarly change the number of particles, except for terms like 


+ a+ + + 
Ay Ag 4,05, Ay Aga, a5,..., 


which conserve the number of particles but act as a mutual interaction between 
them; that is, the particles are no longer independent. The description of a 
mutual interaction will be considered in more detail later. 


Exercise: Verify that the number-of-phonons operator N, defined by Eq. (6.62), 
commutes with a product of creation and destruction operators if and only if 
the number of a*’s equals the number of a’s in the product. 


6.6 FIELD QUANTIZATION 


A notable example of a system with infinitely many degrees of freedom is a 
field. Examples are the amplitude of sound waves, drumhead vibrations, light, 
and so on. Consider a real scalar* field g(x) whose motiont is described by the 
Lagrangiant 


L(o, @) = 4 | exo 4 fas [avKe — x9ao(e’), (6.69) 


where K(x — x’) = K(x’ — x). The classical equations of motion, found by 
varying (x), are 
0= 6 OL o 6L 
ot de(x) = 6 gp(x) 


= G(x) + [arKe — x')o(x’). (6.70) 


* The following procedure can be generalized for a multicomponent field by putting 
indices on everything. 

+ Classically g(x) depends on #, but (as with g; in Section 7.4) we will not show it 
explicitly. Besides, in the Schrédinger picture, the operator g(x) is time-independent. 
¢ We assume that the system is invariant under translations, so that K is only 4 
function of x — x’. 
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Note how Eqs. (6.69) and (6.70) resemble the corresponding equations for 
the system of harmonic oscillators described in Section 6.4: 


L= tai ae 2 Us ;9:9;, 
i if 
0= 4 + DU; 
J 


Thus, we are justified in treating the field as a system of harmonic oscillators 
(at least formally): y corresponds to the symbol ‘“‘g” and x corresponds to i. 
g(x) can be thought of as a separate coordinate of the system for each x. 
As an example, suppose that 
K(x — x’) = —°°V?53(x — x’). (6.71) 


Then Eq. (6.69) becomes, after a few integrations by parts, 
L=4 | dL Alx)6(x) — eV9lx)- Vols] (6.72) 


and Eq. (6.70) becomes 
fA 
Vig(x) — , x) = 0, (6.73) 


which is the usual wave equation. 
If we assume that g(x) is a coordinate of the system for each x, the conjugate 
momentum to (x) is 


6L , 
II(x) = - —~ = @(x). (6.74) 
G(x) 
The Hamiltonian is then 


H 


i 


[xno —-L 


i 


4 [@exneane) +4 [ars [eRe — x')e(x)p(x’). (6.75) - 


To quantize the system we let g(x) and II(x) be Hermitian operators satisfying 

[y(x), g(x’)] = [TI(x), M(x')] = 0, (6.76) 

[o(x), M(x')] = ihd?(x — x’), (6.77) 

and assume that the Hamiltonian is given by Eq. (6.75), except for a scalar term 
to make the ground-state energy zero. 

We next express everything in terms of ‘normal modes.” The situation 


turns out to be slightly different from that of Section 6.4 because it is con- 
Yenient here to use “‘complex”’ (that is, non-hermitian) normal coordinates. 
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Because the system is translationally invariant, we expect that it might help 
to express the fields in “‘momentum” representation. Therefore we define 


Gk) = [asec (6.78) 

Ti(k) = | ateneye (6.79) 
The inverse transformation is* 

glx) = | a she (6.80) 


with a similar expression for M(x). Since w(x) and TI(x) are Hermitian, we have 
Pk) = @-k); T*(K) = M(-4). (6.81) 
From Eqs. (6.76) and (6.77) we obtain 


[H(K), (kK) = (TK), Tk] = 0, (6.82) 
[o(k), F(k’)] = ih(2n)3d3(k + k’). (6.83) 

Now let 
w(k) = [ateKene (6.84) 


From K(x) = K(—x) = K*(x) it follows that 


w*(k) = w?(k)* = w?(—k). (6.85) 
Rewriting the Hamiltonian of Eq. (6.75), we get 
H =! | Ok [h(—Wfla) + o'HH(~H90)] 
2 | (2x)? 
_![d*k 
2 | (2x)? 
We assume that w*(k) > 0, so that the Hamiltonian is positive definite. [Thus 


@(k) is real, and we take w(k) > 0.] 
In the example described in Eqs. (6.71), (6.72), and (6.73), w(k) = c\k}. 


[Tit (ATI) + w(k)6*(DG(K)]. (6.86) 


* Throughout all of this the following integrals will be useful: 


d?xe tk '* = (22) S7(k), ark ek * = §3(x). 
(27) 
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Next we will define annihilation and creation operators (compare Eqs. (6.47) 
through (6.50)): 


k= —| Volk) dk’ + -—— fick 6.87 
a(k) il * w], (6.87) 
a*(k) = hoo @(—k) — - —- A(- |. (6.88) 
J 2h lee ) 
or 
gk) = |—" _[a(k) + a*(-&)], (6.89) 
2w(k) 
fk) = ~1 fi hoo) a) — a*(—K)], (6.90) 
!{he commutation relations are, from Eqs. (6.82) and (6.83), 
[a(k), a(k’)] = [a*(k), a*(k’)] = 0, (6.91) 
[a(k), a*(k’)] = (2n)353(k — k’). (6.92) 


If we write H in terms of a and a*, making the change of variable k > —k 
when necessary, we obtain 


=; d ok 

(2n)° 

plus a correction term to make the vacuum energy zero. The corrected 
Hamiltonian is evidently 


hatk)[a* (k)a(k) + a(k)a*(k)] 


d°k 
H = | — — hotk)a*(k)a(k). (6.93) 
(2n)° 
[Note that the correction term is the infinite quantit —4$ | @kha(k)s°(0)]. 
Finally, we express the original field variables in terms of the creation and 
destruction operators, using Eqs. (6.80), (6.89), and (6.90): 


p(x) = ° i a2 (i [a(k)ei* ~ +4 at(kje**] (6.94) 


T(x) = ae heh) ¢_ig(aye* 4 tae], (6.98) 
a 


Equations (6.91) through (6.95) are the important results of the quantization 
Procedure. 
The commutation relation of Eq. (6.92) may appear strange, in that 
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[a(k), a*(k)] is infinite (instead of unity), so that the analysis of Section 6,| 
does not apply directly to a(k). Suppose, however, that we use a less singulay 
representation. Choose a complete orthonormal set of functions w,(k), where + 
is a discrete index: 


= = £ VWI) = up, (6.96) 
X ve) = (2) — &’) (6.97) 
and define 
a, = ee (on)? wa(k)atk). (6.98) 
Then 
[a,, 4g } = dap, (6.99) 
so that we can apply previous results and construct |n,---1,- he ). But these 


States may not be eigenstates of H. The states 
|k) = a*(k)|0), |k, k'S) = a*(k)a*(k’)[0), 


and so forth, though unnormalizable, are eigenstates of H. 

What kind of phonons do that unnormalizable states represent? The state 
|k> is a phonon of energy fiw(k), and we may also say that it has momentum 
hk. To discover the reason for this, consider the operator 


d*k 


+ 
(n)3 hka*(k)a(k), (6.100) 
which satisfies 
[P, a*(k)] = hka*(k), [P, a(k)] = —hkatk) (6.101) 
so that 
Plk,, &a,--.> = (hk + fiky +°++)iky, kay...,)- (6.102) 
Now, from Eqs. (6.94) and (6.101) we obtain 
[P, o(x)] = ifVe(x). (6.103) 
One can then show that 
et Plitg(ye—4 Pilih = (x + a), (6.104) 


so that P generates translations and is therefore the momentum operator. 
In Sections 6.7 and 6.8 we will describe further the relation between the 
operators and the states they create and destroy, as well as how other operators. 
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guch as the Hamiltonian, can be written in terms of the creation and destruction 
operators using any basis of states. 


Note: In relativistic quantum mechanics, when one quantizes a free-particle 
field with w(k) = Vk? + m? (h = c = 1), a different normalization and 
gummation convention is commonly used for momentum states. Everything is 
written in terms of { d°k/(2x)*2w(k) and (27)?2w(k)5*(k — k’), which happen 
to be relativistically invariant; to accomplish this change of normalization one 
uses a “relativistic” a(k) equal to \ 2@(k) times our a(k). Equations (6.92), 
(6.93), and (6.94) then become 


[a(k), a*(k’)] = (22)°20(k)d°(k — k’); 


_{ ak + 

H = | noe olka *(k)a(k); 
@) =]. a [a(k)e™'* + a*(kje*'*] 
RE Leo col 


In some texts the (27) is also treated differently. We will not use the relativistic 
normalization here, but it is mentioned in case the reader finds it elsewhere and 
wants to reconcile it with our notation. 


6.7 SYSTEMS OF INDISTINGUISHABLE PARTICLES 


In the preceding sections we considered the quantum states of an oscillator 
system as being states of various numbers of a “particle” called a phonon. We 
identified certain states as one-phonon states, and others as states containing 
more than one phonon. 

In this section we shall follow a different line of reasoning. We will start 
with a space of states describing a single particle, either Bose or Fermi, and 
Construct the multiparticle states according to standard methods. For the Bose 
Case we will arrive at a system of states and operators that is mathematically the 
Same as that found previously for a system of oscillators, thereby showing that 
the interpretation of oscillator states as many-phonon states is consistent with 
the usual description of many-particle systems. In the meantime we will have 
also developed a formalism for dealing with Fermi particles, for which the states 
do not resemble those of a harmonic-oscillator system. 

We will treat the Bose and Fermi cases simultaneously, distinguishing them 


by the number C: 
e +1 ifthe particles are Bose 
= (6.105) 


—1 if the particles are Fermi. 


We will use the symbol £? (where P is a permutation) to denote | for the Bose 
fase and (— 1)? for the Fermi case. 
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Consider first the case of distinguishable particles. If |W,)>,..., |W,> are 
one-particle states, then 


lW> = IWa>lW2>- +> IW (6.106) 


describes the n-particle state with the ith particle in state |~,>. If jo) = 
lp;>|~2>- ++ |@,>, then we write 


Kol = (eri eal > (Pal CWi>lW2>> ++ Ia) 
= (Pili ><@2lha>* + COnlWn>s (6.107) 


which defines the inner product <y|y>. The Hilbert space describing the 
n-particle system is that spanned by all mth-rank tensors with the form of 
Eq. (6.106). 

The state in which the ith particle is localized at the point x; is |x,>|x.)°-- 
|x,). As each x; runs over all space, the resulting states form a complete 
orthonormal set for the n-particle space (ignoring spin and other variables): 


(<x, [> ee Cx yr>: re IYn>) >= H(x, x yi): ty d(x, = Vn)s (6.108) 
[ar ae [xt + tay Caf Cel) = 1, (6.109) 


Using this basis we can express the n-particle states in coordinate representation: 


W(X 15.625 Xa) = (Xl Cel. (6.110) 
For the particle |) of Eq. (6.106), we have, using Eq. (6.107), 


Wey) = Valera Ds (ep) 
where W(x,;) = <x,|W,>. 


Next, let us consider indistinguishable particles. We assume that the particles 
obey Bose or Fermi statistics, which means that we must symmetrize or antl- 
symmetrize, respectively, the states obtained in Eq. (6.106). We therefore define 


Wid x [Wed x x [Wad = Qa!) Dive dibeey> ** Wen) (6.112) 


where P runs through all permutations of n objects. It will often be convenient 
to write IW, Wr, erga Un» for wi> x lv2> MR as Wn» 

The space of n-particle states is that spanned by all “‘products” of the form 
of Eq. (6.112). Note that |y,> x--- x |y,)> is totally symmetric in the Bose 
case and totally antisymmetric in the Fermi case, as it should be. 
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example: Let ja) and |b) be two single-particle states. If ¢ = +1 (Bose 
particles). 


la) x |b) = |a, b) = 15 (larle> + |byIa)). 


‘ 
la) x Ja) = Ja, a) = V2Ia Ia). 
If ¢ = —! (Fermi particles) 
la, b) = | (ja)\b) — 1b)\a)), 
V2 
la, a) = 0. 


Thus, we have the expected result that two Fermi particles cannot be in the 
same State. 

What is the inner product of two of these -particle states? The answer is 
given by the following theorem* 


pa ie: = CPilvi> 4 {9 lvn> (6.113) 


Pivcscn Olbises CPnlWi> +02 (Palade 


where, for any n xX n matrix A = (A;,), 

|Al, = » CPAs pc) 1 An Pin): (6.114) 
That is, |A|_ is the determinant of A, and |A|, is what is often called the 
permanent of A. 


Proof: 


(9, een DalWys eee, Wn> = 2B OM Pra ie (Prim MiWocy> ag lWorny>) 


I 
n! 


1 . 
nl ap? CPC op ylWoay> i CPpmlWocn> 


=e VS POG nas Colanig® 


n! PO 


(where we have permuted the factors by P) 


1 vop-1 
> ney (PilWor- tay) *** PalWor- Hin) 


~~. 


* is ; : 
Compare this theorem with the well-known formula for vectors in 3-space, 


oo 
b-cb-d|- 


(a x b)-(e x d) = 


170 Creation and annihilation operators 


(since ¢? = ¢P' and ¢2¢P* = (2? ') 


“ » » CPG ilWrcay> “+ Palen) 


(letting R = QP~') 
= » Ca lweay> a= CPalW rin? 


= Keil 
which is the desired result. 
Now let {|1), |2),...} be a complete orthornormal set of states: 
<alB> = 5p; Y IaxXal = 1. (6.115) 
A complete set of n-particle states consists of |x,, %,..., %,, where a, <--- 
< a, in the Bose case, and a, <---+ < a, in the Fermi case. These states are 


orthogonal to one another, but not always normalized. The reader can show, 
using Eq. (6.113), that for a complete orthonormal set of states we may take 


O1,.--,0 
lorry ++ Om) (a, <°:: <4,) for bosons 
Vn, ! nz! 
|&1,-.-5 (4, <***<a,) for fermions, 
where 7, is the number of times that « occurs in the sequence a,,..., %, (for 


Fermi particles, 7, = 0 or 1). 
For either case the completeness relation can be written in the following 
convenient form: 


ES fee lie, as epee © (6.116) 


n! ay 


Here the range of each a; is unrestricted, duplication of states being taken care 
of by the I/n! and the normalization. In the Fermi case, the terms with non- 
distinct , are, of course, zero. The | on the right-hand side of Eq. (6.116) 
means the unit operator on the space of (properly symmetrized) n-particle states. 
Equation (6.116) can be verified by applying the left side to a state |f,.... Bn» 
and using Eqs. (6.113) and (6.114). 

The case n = 0 may require some explanation. The zero-particle states are 
tensors of rank zero, that is, scalars (complex numbers). They form a one- 
dimensional space, all of whose elements are proportional to the number l. 
The “state” 1 will be denoted by |vac) (or sometimes by |0>) and called the 
“vacuum state.” The zero-particle states are thus spanned by the state |vac)- 
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We have constructed, for each n, a Hilbert space that describes a system of n 
rticles; thus we have an infinite sequence of spaces. In many processes the 
gumber of particles is not constant: particles can be created and destroyed. To 
describe such processes we need a Hilbert space that contains states of varying 
gwmbers of particles. To get such a space we simply combine all the n-particle 
spaces into one big space that we may call the multiparticle space. A general 
state in the multiparticle space is of the form 


W> = [WO + [WPS + WO) + WO) 40>, (6.117) 


where |W”) is an n-particle state. 

We define states of different numbers of particles to be orthogonal to each 
other, so that if |~) is another multiparticle state and is expressed in the manner 
of Eq. (6.117), then 


Lolb> = <p [WO + CoP) +---. (6.118) 


If {|a)} is a complete orthonormal set of states, so that Eq. (6.115) holds, 
then using Eqs. (6.113), (6.115), and (6.118) we may summarize orthogonality 
by 


Cay os OyLBas +s Bad = Sam [OM "7 Petml (6.119) 
OnB 1 Cann g 
From Eq. (6.116) we also have the completeness relation 
— il 
x 7 Dee ei aie et Oia, ee, =e (6.120) 


In this equation ‘‘1”’ means the unit operator on the whole multiparticle space. 

As an example, suppose we describe the states in coordinate representation. 
The (unnormalizable) state |x,,..., x, describes the situation in which there 
is one particle each at points x,,..., x,. Then Eqs. (6.119) and (6.120) become 


B(x, — Wi) 1 P(X, — Yn) 

(E1566 KglVis ee > Vy = Onn : : : ud Ie (6.121) 
B(x, — 1) °° Oy — We 

ed aes a | dadey.. Wiad (e192) 


We may expand an arbitrary multiparticle state |) as follows, using Eq. (6.122): 


>= ¥ + | ax, oe Ce ee (6.123) 
n=0 


Here 
W(X, 4 Xy) = Cty... 5 talW) (6.124) 
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is (if | is normalized) the amplitude for the state |W) to have x particles, ong 
at each x; Note that ¥(x,,..., x,) is symmetric or antisymmetric accordin 
to the statistics. Note also that if |y> is an n-particle state and is of the form 
lW1...-, W,), where each |w;) is a one-particle state, then 


Wo(x,,...,X,) = O unless m =n; 
(n) . _ Wilt1) °° Wyx,) (6.125) 
EP Gi: date ele 


where W(x,;) = <x,lWi>. Equation (6.125) follows from Eqs. (6.124) and 
(6.113). In the Fermi case the determinant is called the “Slater determinant,” 

We are now ready to define creation and destruction operators. These 
operators are fundamental for two reasons; first, we constructed the multi- 
particle states so that we could describe changing numbers of particles, and we 
need some operators that can effect this change, and second, other operators, 
such as the total energy, will turn out to be simply expressible in terms of the 
creation and destruction operators. 

Let |p) be any one-particle state. We define a*(¢) to be that linear operator 
which satisfies 


a*(p)lWy,---, Ua» = 10, W1,---s VD (6.126) 


for any n-particle state |~,,...,w,)>. For n = 0 this is understood to mean 
a*()|vac) = |p). We call a*(g) the creation operator for the state |p), and 
its adjoint a(q) the destruction operator. 

A creation operator clearly converts an n-particle state into an (n + 1)- 
particle state. It is easily seen that a destruction operator turns an n-particle 
state into an (n — 1)-particle state and annihilates the vacuum state. To find 
the effect of a(@) on an n-particle state |, -- + w,)> we multiply on the left by an 
arbitrary (m — 1)-particle state (y,-- + 7%,—11- 


{u Xn lap) +> Wad 
= yo° wla* (Px °° eee 
= wi Nf Wrl®s x es ie 


Wile bila oo Wile |* 
Wl @><WalX1> aoa CWal¥n—1> g 
n <wile> aan CWilen—1>| * 
= {8 o-oo) (n0 Wa) } 
hod CWal%a> ae uel 
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(an expansion by minors) 


YPM alOy* Gs (0 ad alts tan ™ 


py oli nt tee An— 111 tes (no Wy) on Un» 


Because this is for arbitrary (7, -- - z,-1| we have finally, 


apy vad = > Col diy + (m0 Wy) ++)» (6.127) 


Thus the destruction operator removes the states |y;>, one at a time, leaving a 
sum of (n — 1)-particle states. In the Bose case (€ = !) the terms all have 
a + sign, whereas in the Fermi case ({ = —1) they alternate in sign. 

Equations (6.126) and (6.127) describe the action of creation and destruction 
operators on many-particle states. From Eq. (6.126) it follows that 


a*(p,)a*(p2) = Ca*(pz)a* (gy), 


or 
[a*(91), a*(y,)]-, = 9, (6.128) 
where [A, B]_, = AB — (BA; that is, [AB]_(¢ = +1) is the commutator 
and [4, B],(¢ = —1) is the anticommutator. 
Taking the adjoint of Eq. (6.128) we obtain the further result 
[a(p,), a(p2)]-< = 0. (6.129) 


Thus, the creation operators commute for Bose particles and anticommute for 
Fermi particles, and similarly for destruction operators. 

Now, what is [a(@,), a*(p2)|-,? Does it (or any similar expression) 
teduce to anything simple? We first calculate 


a9, )a* (pb, Wy 
= ap, )\(P2, Wi Wad 


= (ilo) Wiad +S CX oWadOas Ys (0 Wa)“ Wad~ (6.130) 


and then 


a*(pz)alo,)W; a JA 


by Cail a* (oli (no Wy) *- Wb,» 


li 
se 
ips 


Oo Ibis lp. Wy ++ (0 YH) --- Wn 
(6.131) 
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Multiplying Eq. (6.131) by ¢ and subtracting it from Eq. (6.130) we see that 


[a(¢,), a*(@)\-¢ = (9,92). (6.132) 


Equations (6.128), (6.129), and (6.132) are the fundamental “commutation” 
relations for creation and destruction operators. 

The relations we have derived are usually stated in terms of an orthonormg} 
basis, and we shall now do that. Let {]a>} = {j1, |2>,...} be a complete 
orthonormal set of one-particle states. It is usual to let a, = a(x). Then, since 
<a|B) = 5,5, we have [a,, ag |_, = 5,5. We consider the Bose and Fermi 
cases separately. 


Bose Case 
Let 
Bocce ds 2s eae 
ed oe bee ee 
Vng!n!+* 
where n, is the number of times « appears in the ket on the right. Then the 


|nyn.* ++) (eachn, = 0, 1, 2, 3,...) form an orthonormal basis for the whole 
multiparticle space. From Eqs. (6.133), (6.126), and (6.127) we find 


(6.133) 


as |nyngs ++ ng?) = Vn + Ainge sng tees), (6.134) 
Ag\ny Ny? + Mgt) = Ving |nyng ng): (6.135) 

The commutation relations are 
[4,. 4g] = [az, ay] = 0; [a,, af ] = Sap. (6.136) 


Equations (6.134), (6.135), and (6.136) are identical to Eqs. (6.60), (6.61). 
(6.51), and (6.52) for raising and lowering operators for a system of harmonic 
oscillators. The operator for the number of particles in the state |a) is 


N, = aja,. 


The notation of Eq. (6.133) (in terms of “‘occupation numbers”) is generally 
not the most convenient. It is more natural, in fact, to continue using the nota- 
tion we have been using all along in this section, that is, the notation |a,, #2,..-- 
a,»>. This notation was discussed in Section 6.5 in connection with phonons. 
Note that Eqs. (6.126) and (6.127) of this section, when applied to states of the 
form |a,,.--,%,), become identical to Eqs. (6.67) and (6.68). Thus, creation 
and destruction operators for a system of Bose particles !ook just like those for 
what we called a “phonon” system. 


61 Systems of indistinguishable particles 175 


Fermi Case 
Using the notation |z,,..., %,), we have 
Gy lo... 25 b> = [L081 - 2 5 Med (6.137) 
and 
GN 0 es hy CY gs lies Beas Sees see (6138) 


k=1 


We could also use the occupation-number notation 
|a;ny--+> = |oy, y= > 


where «, < «%, <---, and, is the number of times « occurs (n, = 0 or 1) in 
this sequence. If n, = 0, then at changes it to 1, whereas a, annihilates the 
state. If, = 1, then a, changes it to 0, whereas a} annihilates the state. There 
are also factors of +1 involved, depending on what other states are occupied. 
It is easiest merely to remember Eqs. (6.137) and (6.138). 

Note that a(~)? = a*(g)? = 0 for any one-particle state |p). This state- 
ment follows from Eqs. (6.128) and (6.129) (with € = —l and 9, = @2 = 9), 
and it is also equivalent to the fact that two fermions cannot be in the same state, 
that is, |g, o> = 0. 

One could also derive Eqs. (6.137) and (6.138) directly from the anti- 
commutation relations 


[a,, ag }4 = [az a5 | = 0; [@, az |+ = Ong (6.139) 


as was done in previous sections for the a,’s of the harmonic-oscillator system. 
But for the Fermi case there does not appear to be any a priori reason for pos- 
tulating Eq. (6.139). (Remember, for oscillators the corresponding commutation 
Tules followed from the canonical quantization procedure.) One may rather 
Consider Eq. (6.139) as derived from the antisymmetrization postulate for 
fermions. 

Let us return to the general case where Eqs. (6.126) through (6.132) apply. 
One advantage of deriving them in such a general form is that we are not tied 
down to a particular basis of states. Suppose we use a basis of momentum 
eigenstates, |p). Because ¢p|p’> = (2n)°6°(p — p’) we have 


[a(p), a*(p')}-, = (27)°5°*(p — p’), 
La(p), a(p')]-; = [a*(p), a*(p'))-; = 0. (6.140) 
From the vacuum state we can construct the other states by 


[Pis---» Pn» = a*(p,)*-- a*(p,)Ivac). (6.141) 
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If we use a basis of position eigenstates |x, then since ¢x|x’> = d°(x — x’), 
[a(x), a*(x')]_, = (x — x’), (6.142) 
|X15- 225 X_> = a*(x,)**+ a*(x,)|vacy. (6.143) 


If we use a basis of hydrogen-atom energy eigenstates |n/m), then [a(nim). 
a*(n'l'm')] ~~ = Say Oy Omm's and so on. 

How do the creation and destruction operators change when we make 4 
change of basis? This question is easily answered by noting that if 


In = al¥> + Bled. (6.144) 
then 
a*(y) = xa*(W) + Ba*(9), 
a(y) = a*a(p) + B*a(), (6.145) 


This means that creation operators “‘transform” like kets, whereas destruction 
operators “transform” like bras (because <z| = «*<w| + B* (|). Equation 
(6.145) are readily generalized to infinite series and integrals. Now if we change. 
for example, from position to momentum representation, so that 


|p) = [exo x [eraners 


6.146) 
d*p ey eet ( 
|x) (anys P<? »= lee lp, : 
the creation operators are related by 
a*(p) = [ser eoer 
(6.147) 


ds 
a*(x = P a* yen Pee 
) (on)3 (Pp 
To relate the destruction operators a(p) and a(x), simply take the Hermitian 
adjoint of Eq. (6.147). One proceeds in a similar way for any other change of 
basis. 


Exercise: Suppose we have a complete orthonormal set of states |x), and we 
let the “wave functions” of these states be (x|a> = u,(x). Write down the 
formulas for a*(x) and a(x) in terms of a7 and a,, and vice versa. 


6.8 THE HAMILTONIAN AND OTHER OPERATORS 


In the last section we developed a method for describing systems containing 
many Bose or Fermi particles, and we defined creation and destruction operators 
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for the particle states. We will now show that these operators have other uses 
than merely creating and destroying states. 

Suppose 4’) is an operator that acts only on one-particle states. We 
wish to find an operator 4 that represents the “sum of A“? over all of the 
particles.” That is, for any n-particle state 


W> = Ibis Wad = Ihid x 0 x [ad (6.148) 
we want 4|y> to satisfy 


AlW) = AP) x [2d x00 [Wad + I> x AM o> x + x Ln? 
+eeo+ [hid x Ibo) Ko x ADW,. (6.149) 


To see what this means, suppose each |y;) is an eigenstate of 4“! with eigen- 
value a;. Then Eq. (6.149) implies that 


Al) = (a, +:°+ + @,)lp). (6.150) 


For example, if A" is the single-particle Hamiltonian, then 4 is the total energy 
(ignoring mutual interactions, which we shall consider later in this section). 
If A“) is the momentum operator for a single particle, then 4 is the total mo- 
mentum. If 47) = 1“) (the unit operator on one-particle states), then A = N, 
the “‘number-of-particles operator.” 
The desired operator A is easy to find. We first find it for the special case 
AY = |z)<B|. In this case Eq. (6.149) becomes 
Alw> = CBlv i> |e, Wo, ae Wa» + (Bly lw, Ayres Wn» 
ae isee ah CBlvn> Wis Way-+05 &). (6.151) 


Now look back at Eq. (6.131) and notice that when g, = x and y, = ff we 
have 


a*(a)a BW = SO "<Blad lay (M0 Vande ad» (6.152) 
But 


‘ame Watsiexay (no Wids ones Wa» =. Way. eey Wits 4, Witiseess Wa? 


due to the symmetry property of the n-particle state. Using this equation in 
Eq. (6.152) and comparing it with Eq. (6.151) we find 


A = a*(a)a(B) when A= eB: (6.153) 


The generalization of Eq. (6.153) for an arbitrary one-particle operator A) is 
'mmediate. We choose a basis—any basis—of one-particle state |x», and write 


AM = ¥ lad<alA MPSA = SS AGad< (6.154) 
x, 28 
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Then, from Eq. (6.153) and linearity, 
A= » ASa* (a)a(B). (6.155) 


As a first example we consider i = 1", so that 4 = N, the operator for 
the number of particles. Using various bases we have 


1 = \ [aca 
= [asic 


- (on . IP><PI (6.156) 


from which we can immediately write 


N = Yaa, 
7 {¢ 3xa*(x)a(x) 


= les a*(p)a(p). (6.157) 


Next, consider the momentum operator. Because 


peta 
Benes | cpp nled<er 
= [ax - Vix| (6.158) 
U 


we have for the total momentum 


= at yp Pa *(p)a(p) 
= | d3xa*(x) | Va(x). (6.159) 
t 


(Compare the first of these expressions for P with Eq. (6.100), another expression 
for P.) 
Finally, suppose the Hamiltonian for a single particle is 


2 
H® = x + Vix), (6.160) 
m 
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where x is the position operator. In coordinate representation, 
(x|HOlx’) = — V25%(x — x’) + V(x)s%(x — x’), (6.161) 
m 
so that 
H= [as far ‘ - V53(x — x') + V(X)d(x - “| a*(x)a(x’) 


= [owe |- vi + vo) a(x). (6.162) 
2m 
In momentum representation* 


2 . re 
(pH ‘Ip’ =P -(2n)°0%(p ~ p') + | d xe? *V(x)oiP * 
m 


2 
=F (2n)35°%(p — p') + Vip - P’), (6.163) 
2m 
where 
V(q) = | d°xV(x)e~ 4", (6.164) 


Therefore 


3p 
= | oes (2m)? 


pr 
n)> 2m 
= P P , + 
= \on n)3 2m (p)a(p) + (on mele ys Viga*(p + q)a(p). 
(6.165) 


3 L ~ 
a*(p)a(p) + lo: ai 4?’ Op — p)a*(p)alp’) 


(2n)? 


The term W(q)a*(p + g)a(p) annihilates a particle of momentum p and recreates 
it with momentum p + q, the amplitude for this process being V(qg). If we use a 
basis of eigenstate |x) of H, so that 


(alH |B) = E,Sag (6.166) 
then 
H =) E,a; a,, (6.167) 


which is what we had for phonons with E, = fia 


‘a? 


* The second term is obtained by noting that 
V(X,) = [@xvene 


and putting this between states <p| and |p’). 
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All of these expressions for H can be derived from each other using the 
formulas relating a* (x) and a*(p), and so on, described at the end of Section 6.7. 
But it is often simplest to obtain such expressions directly from the single- 
particle Hamiltonians, as we have done here. 

The density of particles (that is, number per unit volume) at the point x 
is given by the operator 


p(x) = a*(x)a(x) (6.168) 


(which corresponds to the one-particle operator |x)<x|). Thus the number-of- 
particles operator in Eq. (6.157) may be written as 


N= [a°xoc9, 


and the potential-energy term in Eq. (6.162) as 


V = [arxveonee 


This last equation represents the integral of the potential energy weighted by the 
density. 

So far we have described a system of independent particles, each particle 
being (possibly) acted on by an external potential but no two particles influencing 
each other. Suppose, however, that there is an additional potential V)(x;, x;) 
between any two particles at x; and x, (giving rise to a “two-body” force). We 
assume that V(x; x;) = V@(x;, x;). On two-particle states the operator 
is then 


y2) = 4 | d?x | d®ylx, yVO(x, yx, yh (6.169) 


as can be verified by applying it to a two-particle state |x,, x.>. We now want 
an operator V on the whole multiparticle space such that 


Vix,,.--,%,) = Pol V x55 ie ioe Bap 
i<j 
= 4} VO, x) ties. Xa (6.170) 
‘Fy 


Looking at Eq. (6.169) and noticing that a*(x)a*(y) creates the state |x, y). 
whereas a(y)a(x) destroys the same state, we might guess that 


vV=4 favs [aver eoarnv%e y)a(y)a(x). (6.171) 
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This is in fact correct, as can be verified by applying V to [x,,..., x,)>. Using 
Eq. (6.127) twice, we have 


a(y)a(x)|x1 °° * Xn) 


= a(y) CK-163(x — x,))x, -- > (no x,) °° x, 


k 


ay CSR (x — X) x NiO (Y oe x,)|X; "++ (no X%, x;) pete Xn» 
j= 


k=1 


im: 


j#k 
where 
he if j<k 
mati if 7 > k 
Then 
a*(x)a*(y)a(y)a(x)|x1,---, Xn) 


= yi Cn O(x — x) O(y — xX, y, X1,---, (NO Xp, Xj),--., Xa 
J 


= » On O(x —_ x)O*(p oo Xj)1X,, Xj, pens, (no Xk, x;), area X,) 
= Ye = 28% — xpey aH). 
Jj 


Multiplying by $V x, y) and integrating over x and y, we find that V as 
given by Eq. (6.171) indeed satisfies Eq. (6.170). 

In view of Eq. (6.168) and the remarks following it, we might expect that 
the mutual interaction could also be described in terms of the particle density 
by the operator 


viet far [¢ SyV Xx, y)o(x)p(y). (6.172) 


However, V’ is not quite the same as V. To see the difference we write 
p(x)p(y) = a*(x)a(x)a* (yay) 

Ca*(x)a*(y)a(x)a(y) + d°(x — y)a*(x)a(y) 

= a*(x)a*(y)a(y)a(x) + 3°(x — y)a*(x)a(x), 


| 


so that 


Vi=VvVH+4 [eevee x)p(x). (6.173) 


Thus V’ contains an extra term, which may be interpreted as a self-energy; 
It contributes even when there is only one particle present. The true mutual 
Interaction V is zero unless there are two or more particles. We want only the 


182 Creation and annihilation operators 


Fig. 6.1 Operator V adds momentum g to one particle and subtracts it from the other, 


mutual interaction, because any self-energy (if it exists in nature) can be included 
in the Hamiltonian of Eq. (6.162). Besides, for many potentials (for example, 
the Coulomb potential), V’ is infinite and is not what we would consider to be 
the true energy. 

If we express V in momentum representation, using Eqs. (6.171) and 
(6.147), and assume 


V(x, y) = Vix — y) = Vy — x), (6.174) 

we arrive at 
= 1 d>q d3p d>p' 
2 } Qn)? J (22)? } (2n)° 


Here V(q) is as defined in Eq. (6.164) (in proving Eq. (6.175) we use Vg) = 
W(— q), because V(x) = V(—x)). Note that 


Viqg)a*(p + g)a*(p' — q)a(p’)a(p). —(6.175) 


d*q « 
VIPs, P2> = le Vip: + 9% P2- 9; 


which says that V adds momentum g to one particle and subtracts it from the 
other with amplitude V(q). This process is denoted by the diagram in Fig. 6.1. 
The total momentum is conserved, as we would expect, because Eq. (6.174) 
implies that the mutual interaction is invariant under translations. 

The Hamiltonian and mutual interactions described here all conserve the 
total number of particles (see the exercise at the end of Section 6.5). 


6.9 GROUND STATE FOR A FERMION SYSTEM 


In this section we will consider fermions, so that the results of Sections 6.7 and 
6.8 apply with ¢ = —1. Suppose we know the eigenstates and eigenvalues of 
the single-particle Hamiltonians, H™: 


Hla) = Ela), (a = 1,2, 3,...). (6.176) 
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Then, ignoring mutual interactions, we have for the multiparticle Hamiltonian: 


H = Y Eg: ax (6.177) 


Every state may be built up from the vacuum state by applying creation 
operators: 
[eisce as See a we Ge aes (6.178) 


(Both sides vanish unless the a; are all distinct.) 

It is often inconvenient to refer everything to the vacuum state, as in 
Eq. (6.178). In practice we may be considering states that differ from some 
“ground” state only by the presence or absence of a few particles. Suppose 
there are G particles present. Assume that the energy levels are ordered such 
that 


Behe Bocas 


Then the state of lowest energy is 


lgnd> = |1,...,G) = af --- ag|vac), (6.179) 
which we call the ground state; its energy is 
Ema = Ey, + +++ + Eg. (6.180) 


Any other G-particle state will have some of the levels 1,..., G unoccupied 
and some higher levels occupied. It is convenient to use |gnd) as a reference, 
describing the removal of a particle from 1,..., G as the creation of a “hole.” 
Particles in the levels G + 1,G + 2,... are still called ‘‘particles.” If a 
particle is excited from the level « to the level B(a < G < f), then we say that a 
hole with energy — E, has been created as well as a particle with energy Ey. 

The concept of holes may be formulated mathematically as follows: 
Define 


b, = az for «a <G. (6.181) 


a a 


From the anticommutation relations in Eq. (6.139) we have, if «, « > G and 
B, B’ < G, 


[a,. a, 4 = [a,, bg ]s aaa [b,, by] 4 z= 0, 
(ay, 47 J4 = Saar [b,, bg: | = Opps (6.182) 
[a,, by ]4 = 0. 


Thus the operators aj (a > G) and bj (a < G) behave like creation operators. 


We can now express the states in the form 


[eis ists Bises<5 Bas Bnd) Sass ap by. 2? bg ipnd> - (6.183) 
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(a; > G,B; < G). We call this a state with m particles and n holes. The 
ground state acts like a vacuum state in that 


a,|gnd) = b,|gnd) = 0. (6.184) 
The Hamiltonian of Eq. (6.177) may be written in the form 
H= Y¥ E,aja,+ ¥, E,b,bz 
a>G 2<G 
L E,aza, + y Eb, 6, 4:1) 


= Ema t+ Y E,aza,— Y) E,byb,, (6.185) 
‘s a>G 


a<G 


so that the energy of a hole in state « is — E,. In other words, it takes an energy 
— E, to create the hole. The number-of-particles operator is 


N = VY aja, 
= ¥ afa,+ ¥ (-bfb, + YD 
a>G a<G 
=G+ ¥ afa,- Y bzb,; (6.186) 
a>G a<G 


thus a hole counts as —1. The number of particles and holes outside the ground 
state (that is, the number m + n in Eq. (6.183)) is given by the operator 
N'= ¥ ava,+ ¥ bfb,, (6.187) 
a>G asG 
which counts a hole as +1 and the ground state as nothing. 
Suppose now that a perturbation is applied to the system in the form of an 
external potential 
U = ¥ UDasa,, (6.188) 
2B 
where we assume that the single-particle potential U{{) may be nondiagonal. 
Then 


1),+ 1),+ 
U = Uap a. a, + Y USpasbs 
a> 


a>G 
B>G BSG 
+ Y US ba, —- Y UPbsb, + Y UL. (6.189) 
a<G a<G a<G 
B>G B<G 


The first and fourth terms of this equation modify the energies of the particles 
and holes, respectively, and the fifth term modifies the ground-state energy- 
The second term creates particle-hole pairs, and the third term destroys them. 
Note that N, as given by Eq. (6.186), is conserved, whereas N’, as defined in 
Eq. (6.187), is not. 
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Suppose further that there is a mutual interaction, a two-body potential 
of the form described in Section 6.8: 
V= : 2 : VQ sAz ag a5a.,. (6.190) 
As in Eq. (6.189), we can express V in terms of a, (x > G) and b, (x < G), and 
then use the anticommutation rules of Eq. (6.182) to write V in a form such 
that in every term all of the creation operators are to the left of all of the de- 
struction operators. Such terms are called “normal products.” The result 
will be a number of terms involving four operators, plus other terms that can be 
lumped into U or added to E,ng. 


6.10 HAMILTONIAN FOR A PHONON-ELECTRON SYSTEM 


Now we will consider, as an application of the creation and destruction operator 
formalism, the interaction between the electrons and lattice vibrations in a 
crystal. It is this interaction (as well as the presence of crystal impurities and 
imperfections) that accounts for the finite conductivity of metals under most 
conditions. (One might otherwise expect from the band theory of metals that, 
once an electron got into an unfilled band, it would move unhindered, resulting 
in infinite conductivity.) We will show how the Hamiltonian is derived and 
written in terms of creation and destruction operators. 

We start with the electron in a lattice with no vibrations. Let N = a,a, + 
N2@, + n3a, (where n,, n2, n3 are integers) describe the positions of the nuclei. 
The potential V(x) felt by an electron in the lattice is periodic and has the form 


V(x) = > Vo(x — N). (6.191) 
N 
The Hamiltonian for a system of independent electrons in the lattice is, from 


Section 6.8, 


He | d? xa* (x) |- ia ae rc | a(x). (6.192) 
m 


Now, suppose we have solved this part of the problem and know the one- 
particle eigenstates |~) and eigenvalues E,. Denoting <x|x)> by @,(x) we have 


|- te V+ ria) | g{x) = E,9,(x). (6.193) 
2m 


Then we can express the electron Hamiltonian as 
Hay bara: (6.194) 


Because 


|x) = 2 la><a|x> = 2 lx>pz(x) 
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we have 
a*(x) = ¥ az eF(x) (6.195) 
and similarly 


aj = [xeon (6.196) 


To obtain relations between the destruction operators, simply take the Hermitian 
adjoints of Eqs. (6.195) and (6.196). 

If, as in Section 6.9, we refer all states to a ground state (called the “electron 
sea’’), which is normally filled, then in the notation of that section, 


Ha = Ena + ¥ E,aga, — Y E,bf by. (6.197) 
a>G 


a<G 


In what follows we shall ignore the ground-state energy, so that the electron 
Hamiltonian becomes Hj: 


a= > E,aza,— Y E,bz b,. (6.198) 
a>G a<G 


Note that, in terms of electron and hole operators, Eq. (6.195) and its adjoint 
become 


a*(x) = Yi azox(x) + Yo bez), 
a>G a<G 


a(x) = D> GPX) + p> bs p(X). 


(6.199) 


Consider next the lattice vibrations. For each N let Zy be the displacement of 
the corresponding nucleus from its equilibrium position N. The Z, form a set 
of coordinates for a system of harmonic oscillators. The procedure for finding 
the normal modes and quantizing this system is similar to the field quantization 
of Section 6.6, except that Fourier transforms are replaced by Fourier series, 
with the “momentum” & of the phonons running over a limited region. The nor- 
mal coordinates q,,, are related to the Zy by equations of the following form 
(assuming for simplicity that ‘‘a” runs from 1 to 3 as in the case of one atom per 
unit cell of the crystal): 


ds = 2 | asta : Zn, 


(6.200) 


N 
Il 
1h 


Here the region of integration is K = {k| — x <k-a; <1; i = 1, 2, 3}; 
V = |a, x a) - a,| is the volume of the unit cell of the crystal lattice (recall that 
the a; are vectors describing the periodicity of the lattice), and e, , (a = 1, 2, 3) 
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form an orthonormal basis of 3-space for each k (in that ef: e,4 = 5g), 
chosen so that the mode (k, a) is a normal mode. 

You should check the consistency of Eqs. (6.200) as an exercise. It is 
convenient to use the relation 


@k giK(N-N’) 
K (2x)? 


and the easiest way to verify it is to make a change of variables r; = K-a;. 
N — N' can be written as 


ON,N’> 


where m;, are integers, so that K-(N — N’) = m-r, and the Jacobian of the 
transformation is 


Equations 6.200 are essentially a special case of Eqs. (1.28) and (1.29). The 
change of notation is 


Chapter 1 Chapter 6 
Q,(k) Ter 
VV 
a’(k) The ath component of e,, 
Za,N The ath component of Zy 
af v 
n 


The normalization conventions used in Eq. (6.200) are convenient because, 
In the limit that the lattice approaches a continuum (i.e., a; > 0), K becomes all 
of momentum space and, letting x = N, we have 


VyY= [es and 7 uw > d(x — x’), (6.201) 
N 


Thus we have obtained the normalization of Section 6.6 for a field (x), where 


VMIV Zy > 9(x)- 
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Following the procedure of Section 6.6 we have an expression for Zy in 
terms of creation and destruction operators: 


y= | Sa y (a ik EA Mea + AN, ae Meg 
(6.202) 
where 
[A(k, a), A*(k’, a’)] = Qn) 5%(k — k5,9°- (6.203) 


M is the mass of the vibrating atom, and w(k, a) is the frequency of the cor- 
responding mode. The Hamiltonian for the oscillator system (apart from a 
constant term) is 


Hose = lee ee KY halk, a)A*(k, a)A(k, a). (6.204) 


(We have used A for the phonon operators* to avoid confusion with the electron 
creation and destruction operators.) 

Having written down Hamiltonians for the electrons and the phonons 
(lattice-vibration states), we now turn to the interaction between them. The 
potential energy of an electron in an undisturbed lattice was given by Eq. (6.191). 
If the nucleus at N is displaced by an amount Zy, then the potential energy 
changes to 


V(x) + AV,(x) = ¥ Vox — N — Zy), (6.205) 


where we have assumed that each atom in the crystal acts like a rigid body 
when it is displaced, so that the potential V, arising from it is simply displaced 
by Zy. (In practice, not all the electron shells around the nucleus move by the 
same amount, so that the potential changes its shape as well as being displaced; 
however, we shall ignore this fact.) The potential V,(x) was already included 
in H,. Writing 


Vox — N — Zy) © Volx — N) — Zy-VVo(x — N) 


we have for the interaction energy (that is, the extra energy of the system due to 
displacement of the lattice): 


AV,(x) = ~ ¥ Zu Wolx — N). (6.206) 


* Note that A*+(k, a) (as well as the state it creates) has the dimensions of (length)?/7, as 
do the corresponding operators for other particles in momentum representation. 
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For the many electron system the interaction Hamiltonian is 
Hint = | d>x AV,(x)a*(x)a(x) 
= Y Yost." (6.207) 
where 


ye fe xoh(x) AV; (x) 9(2) 


N 


- V2 ‘fa > xpF(x)WVo(x — N)e,(x). (6.208) 


Now V,, is also an operator on phonon states, because Zy is an operator. 
Using Eq. (6.202) we 24 


&, 


=) EY (Cyplk, a)A*(k, a) + CE(K, aA(k, a], 6.209) 


ne 
where 
Cap(k, a) = ~ J Le Coa De® | dP xpx(x)VVolx — N)og(x). 
2Ma(k, a) 
(6.210) 
Therefore, 
Hin. = a m3 Sy LCaslks DAs a) + Chalks aACk, a))az ay, (6.211) 
nu a ap 
The total Hamiltonian of the system is 
H = He 5 Ags “iE Hint (6.212) 
Note: The states of our system are of the form 
Js. 0 65 Ons Ky, Gy, --- 5 Kins Am» (n electrons and m phonons), 


or, if we use the hole notation, 


[Bis ay Ons Biss e335 B,,3 ky, Biywicey Ms a,» 
(a electrons, m holes, and p phonons). 


The effect of a creation operator 4*(k, a) on such a state may be defined as 
A*(k, a)lelectrons, holes, k,, a;,...) = |k, a, electrons, holes, a,,... 5 
which is in turn defined to be 


|electrons, holes, k, a, k,, a,,...). 


190 Creation and annihilation operators 


The result of this definition is that all phonon operators commute with a]| 
electron (and hole) operators. (In general, it is conventional to say that creation 
and destruction operators for different particles always commute unless the 
particles are both fermions. In the latter case it is convenient to define the 
operators to anticommute; then everything is consistent if we decide to call the 
particles different states of the same particle, as with the proton and neutron.) 

If we write Eq. (6.211) in terms of electron and hole operators, we get an 
expression four times as long, involving terms of the form 


Ata‘*a, Atatb*, At ba, A* bb*, 
A ata, A atht, A ba, A bb*. 


These terms represent the transition of an electron or hole from one state to 
another, or the creation or annihilation of an electron-hole pair. In each 
process a phonon is emitted or absorbed. 

The foregoing derivation assumed that there is only one atom per unit 
cell of the crystal. However, the results are similar if there are more atoms 
(say A) per unit cell, the only difference in the final result being that there are 
more phonon modes for a given k (a running from | to 3A). Asa simplification 
it may turn out that some of the modes do not “‘couple” to the electrons, that is, 
they do not influence the potential felt by the electrons; these modes are 
independent of the others and may be ignored. Such is the case in the Polaron 
Problem (Chapter 8), where only one phonon mode contributes for each k. 


6.11 PHOTON-ELECTRON INTERACTIONS 


Suppose we shine light on the crystal of Section 6.10. What is the Hamiltonian 

now? To H we must add a term H, for the free electromagnetic field (y refers 

to phonons) and a term H,, for the interaction between electrons and phonons. 
For the free electromagnetic field, the classical Lagrangian density is 


L = \(E” — c?B’) 
= 4[A? — c2(V x A)*] 
3 
= | 2 AMA, Av Ad, (6.213) 
where A(x) satisfies the subsidiary condition*) 
V:-A=0. (6.214) 
* We assume also that the scalar potential is zero. We are using “rationalized units,” 


but with ¢9 = 1; thus e?/4zfc ~ 1/137 and V-E = p, V x B= (i/e?)(j + GE/ét), 
etc. 
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The equation of motion resulting from the Lagrangian is 


a 2 
0 = [V2A — W(V-A)] — A Salers 
Cc 


| A. (6.215) 


Carrying out the quantization procedure of Section 6.6, we find w(k) = 
clk| = ck, and 


A(x) = = oe y a [C(k, r)e#* + Ck, rye **Je,,. (6.216) 


Here e,,, (r = 1, 2) are two unit vectors, perpendicular to each other and to k, 
and C*(k,r) is the creation operator for a photon with momentum fk and 
polarization e, ,: 


[C(k, r), C*(k', r')] = (22°53 (k — k’)6,,-. (6.217) 
The Hamiltonian is 


H, = ya hckC *(k, r)C(k, r). (6.218) 
je in)? 


To find the photon-electron interaction we replace the operator* P by 
P + eA(x) (the charge of the electron being —e) in the single-particle 
Hamiltonian 


2 
HY = +h, 
2m 


so that 
2 
HY + HL) = (Pt ed) V, 
2m 
e e 
= H{) +—(P-A+A4-P)+—A*. (6.219) 
2m 2m 
Writing 
A(X) = | ateacornrca (6.219’) 
We have 


Hy? = = | d?xA(x) jx) + = | d>xA(x)*\x)<x], (6.220) 
m 


* Pand X are the electron momentum and position operators, and A(x) operates on 
Photon states, so that A(X) (see Eq. (6.219’)) operates on both photons and electrons. 
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where 


JOR) = - 7 (Pix><x| + |x)<x|P) 
m 


- < i[(V[x))<x] — |x)(VCxI)]. (6.221) 
m 


Note that j“)(x) is the charge —e multiplied by the probability current-density 
operator. The expectation value in any one-electron state |) is 


WCW = - <a LYCOWW*(X) — VA). 


By inspection of Eq. (6.220) we have, for the interaction between photons and 
systems of arbitrary numbers of electrons, 


H,, = — [ae -j(x) + < | @xaeytar nate, (6.222) 
m 


where 


jx) = - oo i[Va*(x)a(x) — a*(x)Va(x)] (6.223) 
m 


is the electromagnetic current-density operator. 

In Eq. (6.222), A(x) is itself an operator for each x, given by Eq. (6.216). 
If we use Eq. (6.216) in Eq. (6.222), and also express a*(x) in terms of af using 
Eq. (6.195), we get terms involving 


Crk, ray ap, C(k, raz ag, C{k, r)C(k’, r')ay ag, C(k, r)C*(k’, r')az ap. 


and so forth, which have interpretations similar to those of the phonon- 
electron interaction. 

The electromagnetic current defined in Eq. (6.223) may be expressed in 
momentum representation as follows: 


. da ’ a , Sip rietiite 


This expression will be useful in Section 6.12. 


6.12 FEYNMAN DIAGRAMS 


A graphical method employing what have come to be called ‘““Feynman diagram a 
has proven to be very convenient in dealing with perturbation solutions of 
complicated Hamiltonians. These diagrams serve as a “bookkeeping” device 
to keep track of all the perturbation terms and a guide in writing down the value 
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of each term. (The diagrams assume their full power in the relativistic case, 
with which we are not concerned here.) To show how the method works we 
will consider the Hamiltonian for a single, otherwise free, electron interacting 
with the electromagnetic field: 


H= Hiree ot Fins 


where 


3 Pp’ 
A ¢ree = lé oe ae a*(p)a(p) ai le 


(on y, : ckA*(k, r)A(k, r) (6.225) 


and 


Hin = — [arse - A(x) + [ax < a*(x)a(x)A(x)*. (6.226) 
m ? 


Here we denote the photon-creation operator by A*(k,r) instead of by 
Ct(k, r): 


A(x) = a . a —— [A(k, ne** + At(k, rye~™'*]Je,,, (6.227) 
and j(x) is given by Eqs. (6.223) and (6.224). In this section, A = 
The states under consideration are of the form 


IPs kis tis... 3 Ks ta (one electron and n photons). (6.228) 


According to standard quantum-mechanical techniques, the transition 
amplitude M;; for a transition from an initial state |i) to a final state |f) can 
be expressed in terms of the Hamiltonian (to second order) as follows: 


(2n)°5%(p, ~ PMy = <ElHliy + D ee aes i> 4 OCH), (6.229) 


Cross sections and transition rates are proportional to the absolute square of 
the amplitude: 
Rate = y |M,,|7(27)*8°( p, — pO(E; — Ej). 
£ 


In practice, the states |i) and |f) are usually states of the form given in Eq. 
(6.228), that is, states of particles of definite momentum. It is therefore con- 
venient to express H;,, in terms of the A(k, r), substituting Eq. (6.227) in 
Eq. (6.226), and to use Eq. (6.224) for j(x). 


Hin = A, a a H, (6.230) 
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where 


1 


_ (dep ( dp [ ak Pi ad 1 
~ | (22) | Qn)? | (22) 2m Ean as 
x [(22)?57(p' — p — k)a*(p')a(p)A(k, r) 


+ (2n)°5°(p’ + k — p)a*(p')a(p)A*(k, r)], 
and 


H, 


_ (dp (dp (ek (Pkyoe 1 ae 
~ J an? J 22)? | 2x) | Oxy 2m Jack J2ck 
x [(22)?5°(p' — p — k’ ~— k)a*(p’)a(p)A(k’, r’)A(k, r) 

+ (272)?57(p’ + k’ — p — k)a*(p')a(p)A* (k’, r')A(k, r) 
+ (22)°53(p' + k — p — k’)a*(p')a(p)A(k’, r’)A* (k, r) 


(6.231) 


+ (2n)25°(p' + ko +k — p)a*(p’)a(p)A*(k', r')A*(k, ry]. (6.232) 


Note that the total momentum is conserved. 


Each of the terms in Eqs. (6.231) and (6.232) is represented by a diagram 
in which a straight line denotes an electron and a wavy line a photon. For H, 


the diagrams are given in Fig. 6.2a and 6.2b, and the amplitude for each is 


For H, the diagrams are those shown in Fig. 6.3a through 6.3d, and the 


amplitude for each is 
A Asc Ds ey 
2m Jrck'V2ck *" 


(pt+k= p’) (p=p'+k) 


(a) (b) 
Fig. 6.2 Feynman diagrams for H,. 
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(p+k+k'=p') (p+k=p’+k’) 
(a) (b) 


(p +k’ =p’ +k) (p=p +k+k’) 


(c) (d) 
Fig. 6.3 Feynman diagrams for H. 


Note that parts b and c of Fig. 6.3 represent essentially the same process. In 
fact, after the integrations are performed, the second and third terms of 
Eq. (6.232) are equal;* so in any process involving such terms we need only 
calculate for the case shown in Fig. 6.3b, say, and multiply by 2. 

Now suppose we have a definite amplitude to calculate, for example, that 
for Compton scattering. The whole process is denoted by Fig. 6.4. To find the 


(Pp, + k,= pot k,) 


Fig. 6.4 Feynman diagrams for Compton scattering. 


— 


*We ignore the infinite self-energy of the electron that comes from rearranging the 
A and At of the third term in Eq. (6.232). 


196 Creation and annihilation operators 


(a) (b) (c) 


Fig. 6.5 Feynman diagrams with the same ingoing and outgoing lines as Fig. 6,4. 


amplitude we draw all diagrams having the same ingoing and outgoing lines as 
that of Fig. 6.4, using those of Figs. 6.2 and 6.3. Thus we get the three cases 
shown in Fig. 6.5a, b, and c. Parts 6.5a and b correspond to the second-order 
term of Eq. (6.229), in which the intermediate state |n) consists of one electron 
and H = H, + O(e?). (The fact that the photon lines cross in Fig. 6.5b is of 
no significance.) Using Eqs. (6.229) and (6.231) and removing the factor 
(27)°53(p, + kz — p; — k,), we get for these contributions to the amplitude 


(=; Pee sled ee )( e Bishi EB g.g ) 
29F2 1sF 1 


M® = J 2ck, 2m V2ck, 2m 
= 2 2 
ES eke, et ey 
2m 2m 
(6.233) 
and 
( €  P2+ Pi ~ kz ' )( € Pi ~ ky + Pi es ) 
uM V2ck, 2m mY \ 2ck, 2m oe 
2 = 2 
Py + ck; (P1 k2) 
2 2m 
(6.234) 


We may associate the two factors in the numerator with each vertex, and the 
denominator with the “propagation” of the intermediate electron. 
Figure 6.5c corresponds to the first-order terms of Eq. (6.229) in which only 
H, can contribute. Remembering the factor of 2 due to the two ways in which 
this process occurs in Eq. (6.232), we get 
MO =2- ai gee he ree ee (6.235) 
2m J2ck,J2cky 
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gi2 


The total amplitude for Compton scattering is 
M = M® + M® 4+ M© + O(e4) (6.236) 
Jn equations (6.233) and (6.234) 
Pi t+ Ay = pr + kp and hi Cir, = Ka Chae, = 9. 
In fact, if the initial electron is at rest (i.e., p; = 0), then M and M® both 


vanish. 
The results we have derived are satisfactory for low photon energies 
(ck, « mc’). For high energies we would have to use a relativistic theory and 


also take into account the spin of the electron. 


CHAPTER 7 


SPIN WAVES 


7.1 SPIN-SPIN INTERACTIONS 


In this section we will consider interactions that give rise to a coupling between 
the spins of two objects of the form 


Heo S,-S). (7.1) 


In later sections we will find out what happens when such a coupling appears 
in a lattice full of spins. 

An interaction of the form of Eq. (7.1) can be due, first of all, to the magnetic 
interaction between two magnetic dipoles. The interaction between two magnetic 
dipoles #, and uw, with separation r is 


. . . 8 
H = eo 23 Gn the) Mia ”) — Fa m8) =H +H" (72) 


where H’ is the term in brackets. (The term H” with 53(r) arises from a corres- 
ponding term in the magnetic field of a dipole, which appears when one evaluates 
V x (wx v)/r?). 

Suppose the two particles are orbiting around each other (as in an atom) 
and we consider the matrix element of H between the two states of orbital 
angular momentum / and /’. As a function of the direction (0, g) of r, H’ is a 
combination of spherical harmonics of order 2; therefore the matrix element 1s 
zero unless |/ — 2| < /’ < / + 2. In particular, the first-order splitting of the 
S-state (I = I’ = 0) is zero. (This argument is just another way of saying that S 
states are spherically symmetric and H’ averaged over all directions is zero). 
However, H” contributes only for S states, because only those have nonzero 
wave functions at r = 0 (a state with angular momentum / varies as r! near the 
origin). Thus, for S states we have an interaction proportional to py ° #2- 
which is proportional to S, - S, because 


Hie = Gi at S; (7.3) 
2m,c 


for a particle with mass m,, charge g; and g-factor g; (Eq. (7.3) in fact 
defines g;)). 
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In addition to direct magnetic interactions, a term of the form of Eq. (7.1) 
can appear indirectly as a result of spin-independent interactions combined 
with the exclusion principle. As an example, consider the hydrogen molecule; 
take the nuclei to be fixed and ignore their spins. The two electrons may be 
described by their positions x, and x, and some component of their spins S, 
and S>. If, and y, are energy eigenfunctions for a single electron with energies 
E, and E,, then 
WaslXi, X2) = Walx )Wy(x2) (7.4) 
has energy £, + E,, neglecting mutual interactions between the electrons. The 
state Wya(X1, X2) = W,(X1),(X2) has the same energy. 
If we add a mutual interaction V(x,, x.) = V(x,,x,) which is spin- 
independent between the electrons, then the two-electron eigenfunctions become 


ail, 


Ws = —- (War + Wa), 
V2 
(7.5) 
! 
Wa om V2 (War = Woa)s 
with their respective energies 
E;=E,+£,+1-J 
(7.6) 


E4 


i 


E,+ E, +14 J. 
Here 


[= [ave d°x, d°x, =, aac d°x, d°x, 
and (7.7) 
~ | vaY d>x, d°x, = — | vieY Ye dx, dx. 


— 
i 


As implied by the notation, W, is symmetric and y, is antisymmetric under 
exchange of x, and xp. 

We have not yet reckoned with the electron spins or the exclusion principle. 
We must multiply our spatial wave functicns by spinors that describe the 
Possible spins of the electrons, and the total wave function must be antisym- 
Metric under the simultaneous exchanges of coordinates and spins. Now for 
two spin-4 particles, the symmetric spin states have total spin 1, and the anti- 
Symmetric states have spin 0 (see Section 7.2). The total wave functions are 
then of the form 


Ws = wsx. Y= vat? (7.8) 


Where y“) is any two-electron spin state with total spin s. (Note that S and A 
Still refer to only the spatial symmetry). In other words, the energy eigenstates 


200 Spin waves 


are also eigenstates of the total electron spin. We can then write the energy jn 
terms of the spin as follows: If S is the total spin, S-S has eigenvalues 
s(s + 1) = Oand 2. The energy of a state with total spin s can be written in the 
form 


E = Es + (E, — Es) 


1 


because it equals E,; when s = 0 and E, when s = 1. Therefore we can write 
the mutual-interaction Hamiltonian for the system in the form 


HE, +(e, - 29 =. (7.10) 
Now, 
S:-S = (S, + S.)-(S, + S.) = 3 + 28,°S,, (7.11) 
because 
§;S;=44+ )=% 
Therefore 


H = Ey + (E, — Es)(% + S; +S) 
HEs + 3E4) + (Ey — Es)S;- Sz 
E,+ Ey + 1+ 4J + 2S, - 8. (7.12) 


In other words, the splitting of the energy levels is described by 
AH = 2JS, - S), (7.13) 


which is of the form of Eq. (7.1). We have thus taken a spin-independent inter- 
action (e.g., a Coulomb potential) between the electrons and made it look like a 
spin-spin interaction. Note that the splitting is proportional to the ‘“‘exchange 
integral” J defined in Eq. (7.7), which depends on how much the wave functions 
W,(x) and w,(x) overlap. Note also that the derivation of Eq. (7.12) depends 
crucially on the fact that electrons have spin 4; for higher spins we would get a 
polynomial in S, - S, if we went through a similar procedure. 

The interaction of Eq. (7.13), due to the exchange integral, turns out to be 
very strong compared to the usual magnetic-dipole interactions. 


7.2 THE PAULI SPIN ALGEBRA 


It is convenient and conventional to use the Pauli spin matrices when dealing 
with a particle of spin 4: 


Re Deel so eee 0\ (7.14) 
1 0 i io 0 -I 


The spin is then 
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where 
o = 0,€, + Gye, + 0,€, (7.16) 


and where we take fi = | (otherwise S = ho/2). Equation (7.14) implies that 
we are using a basis of states in which the z-component of the spin is diagonal- 


jzed; the states 
ieee (6) and |B) = (;) (7.17) 


are states of spin up (S, = 4) and spin down (S, = —4), respectively. From 
Eq. (7.14), we have 


ox|2) = |B>, — |x) = iB), o,|a)> = |x), 


. (7.18) 
o,|B> = |a>, ay |B) = —ila>, |B) = — IB). 
The algebra of the o-matrices is as follows: 
0,0, = —0,0, = i¢,, (7.19) 
and cyclically in x, y, 2; 
6,0, = 6,0, = 0,0, = |. (7.20) 


Alternatively one may use the following elegant form of Eq. (7.20): 

(a: o)(b:- co) = a:b + ia x b-a, (7.21) 
where the a; and 6; are numbers (or other operators, provided that they commute 
with o). 

Consider next a system of two spin-} particles (ignoring spatial and other 
degrees of freedom). As a basis of states we may take 


|ax>, |aB>, |B), BB), (7.22) 


where, for example, |x} is the state in which the first particle has its spin up 
and the second has its spin down. We then define o, to act on the first spin and 
@, on the second. That is, for example, 


o,,|22) = i|Ba>, 
O2y\aa> = ilap). 
Note that o, 05 j = %;0,;, which is to say that operators pertaining to different 
Particles commute. 
We have previously considered (and will consider further in the section to 
follow) the operator S; - S, = 40,-6,. By explicit calculation, we find 
6, °o,\ax)> = jax, 
a, °6,|%8) = 2|Ba> — |aB), 
a, -@2\f2) = 2|xB) — |Ba), 
3, °92|BB> = |B). 


(7.23) 


202 Spin waves 


Writing |xw> = 2|ax) — |ax) and similarly for |Bf, we immediately come upoy 
the useful relation 


G,°0, = 2p? — 1, (7.24) 
where p'’? is the operator that exchanges the spins of the particles; that is, 
pi? |axn> = lax),  p'?|jaB> = |B), and so on. 


The symmetric states 


[1,1) = laa), [1,0) = eae I, —1) = 168) (7.25) 


all satisfy p'?|1, m)> = |l, m> and therefore o,-o5|1,m) = |l,m) (m = 
1,0, —1). The antisymmetric state 


10,0) = 1*8> — 1x (7.26) 
V2 
satisfies p'?(0, 0) = —|0, 0) and thus o,-@,|0,0) = —3{0,0). Therefore 
the eigenvalues of ¢, -o, are | and —3. Now the total spin is 
and its square is 
S:-S= 434 to, - ap. (7.28) 


Therefore 
S-S|l,m> = 2\1, m), S-S|0,0>5 = 0, 


so that (as we anticipated in the notation) the states |1, m) have spin I and 
|0,0> spin 0. The fact that the symmetric states have spin 1 and the anti- 
symmetric ones have spin 0 may also be seen directly from the relation 


S:S =p’? +1, (7.29) 
which is derivable from Eqs. (7.27) and (7.28). 


7.3 SPIN WAVE IN A LATTICE 


Consider a lattice, the ith member of which has spin o;. The interaction betwee" 
the spins is due to the exchange effect, and the Hamiltonian is given by 


H = De A, j6; = G ;. (7.30) 


J 
Consider the lattice to be cubic, and let each point on the lattice be denoted 
by the integer vector N = a(n,e, + nye, + n,e,), where a is the lattice spacing: 
Then 


H= BS Ayn’ Oy * On’ (7.31) 
N,N’ 
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It is clear that the interaction depends only on the distance N — N’ = M, 
and if in addition it is assumed that only nearest-neighbor interactions are 


jmportant, 


H= » A, (On * On+a0e, + ON’ On+ae, + IN’ ON +a0,): (7.32) 


Occasionally next-nearest-neighbor interactions are important. They may 
be even more important than the nearest-neighbor ones, but such a situation 
can be analyzed in the same way that we use here. 

We now return to Eq. (7.32) and consider the one-dimensional case. The 
extension to three dimensions will be immediate. We also are interested in the 
ferromagnetic case (A, < 0), and we can neglect boundary effects by con- 
sidering the system to be a closed loop. Let there be N lattice points; gy, = oy. 
Thus, 

H = -A¥a,° 6,4; (7.33) 
n 


where 4 = —A,. The minus sign in front of A is included to emphasize the 
fact that in the case of ferromagnets, —A < 0. That is, the energy is most 
negative when the spins are parallel. The (rare) instances when —A > 0 are 
due to a double-exchange effect. Recall that we used x to represent spin up and 
B spin down, and we defined the spin exchange operator p'* = 4(1 + 6, -o)), 
satisfying 

pi lax = lao, 

p'|BBY = |BB), 


7.34 
p'* |p» = |Ba>, ( ) 
p'*|Ba) = |xB). 
Then 
N N 
H = -A ¥ (2p™"*! — 1) = NA — 2A ¥, pot, (7.35) 
n=1 n=1 


To solve Schrédinger’s equation with the above Hamiltonian, we must find the 
eigenstates and eigenvalues of H. To do so, notice first that Eq. (7.34) implies 
that H operating on a state with a fixed number of f’s gives a state with the 
Same number of f’s. Since all states can be represented as linear combinations of 
those with a fixed number of f’s, we have a method of partially diagonalizing H. 

Rather than trying to completely diagonalize 7, let us merely find eigenstates 
with low energy. 

One suspects that the lowest state (most negative energy) occurs when all 
the spins are parallel. For example |xx--- > is such a state. More generally, 
when we say all spins are parallel what we really mean is that the total spin is as 
large in magnitude as possible, and it can be shown that the states that satisfy 
this condition are completely symmetric states. The completely symmetric 
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states are invariant under p""*'; so p""*?! can be replaced by | in computing the 


energy of those states. Thus, we expect that 


Egna = (completely symmetric spin state |H| completely symmetric state) 
N 
NA — 2A x! = —NA 


is the lowest-energy state. That this is the lowest energy is obvious. The 
eigenvalues of p"""*! are +1; thus the minimum conceivable energy can occur 
only when for all » we get the maximum possible expectation value for p""*!. 
that is, when we can replace p™"*! with +1, 


Problem: If —A > 0, what is the lowest state and what is the lowest energy? 


In order to take the ground state as the zero of the energy spectrum, define 
N 
H'=H + NA = —2A XY (put) re 1). 
n=1 


One may suspect that the next-to-lowest states has all but one spin parallel. 
But which one? It is clear that 


IWi> = [Bax---a), |2> = laBa---a>, 
lW,, = la-s:aBa--+a> (Bin nth place) 


are equivalent and that none of them can be an eigenfunction. We look for a 
superposition of the |y,). Let 


(7.36) 


> = Ya, W,>. (7.37) 
To solve the equation H’|~>) = Eli), 
> (Ea), = Ely) = H'lw) 
N N 
= -24 Ya, x (p™™"** — Ibn) 
N 
= —2A }) a[(p"** — 1) + Cp" — D]ly,> 


= 


ll 
Le 
oe " 


a aLlWn+1> + Wa-1> — 2lWn>]- 


Because the |» are sata we can equate coefficients of |w) to get 
Ea, = —2A(Qy41 + G1 — 2G). (7.38) 


To solve this set of equations, let a, = e!" and substitute into Eq. (7.38). Wé 
thus obtain 


E = —2A(e® — 2 + e”) = 4A(1 — cos 56) = 8A sin? :. (7.39) 
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(a) 


Fig. 7.1 Energy as a function of 6. 


Neglecting boundary conditions as we have done is tantamount to assuming 
that the N lattice points are arranged in a circle, or that the N + I point is 
the first point. In other words, 


Quay Say Or. US et 
This means that 6 = (2z/N)J/; / an integer; 

—-N/2 <1 < N/2. (7.40) 
We have, from Eqs. (7.39) and (7.40): 


2 2n1 


E = 8Asin - = 8A sin (7.41) 


This function is plotted in Fig. 7.1. There is, in other words, a band of energies, 
the minimum energy occurring at 6 = 0. If 6 = 0 we have a completely sym- 
metric state, which we have already concluded has the Jowest possible energy, 
E = 0. Another way of seeing that the state with 6 = 0 has the same energy as 
lax--- a is as follows. Let each spin be tipped from its “up” position by an 
angle 6, where @ is very small. Such a state is given by 


In = (cos 5a + sin 5 pcos 5a + sin 5B) ). 


Because 6/2 is small, cos @/2 x 1 and sin 6/2 x n « | (y = 0/2). Then 
Ix> = Ia + nB)a + nB---)). 
Neglecting terms in 7? or higher powers of y, we have 
[> = axe) + (Pace ae) + [xB ae) +o), 


Subtracting out the ground-state term, we obtain |7’) = >, n|W,)> where |w,) 
IS given by Eq. (7.36). Since eM must be an eigenvector of H with the same 
Cigenvalue (energy) as |x, 4,..., «,* so must |7’), and (1/n)|x’) = ¥, |W). 


* Since j in the absence of external interaction there could be no change in energy when 
t of the spins are rotated in a certain angle. 
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For 6 # 0, we see that the spins are out of phase. This suggests the concept 
of a wave, and we now rewrite e" to make the nature of the wave somewha, 
clearer. 

The one-dimensional integer “vector” is na, where a is the lattice spacing 
We can write a, as a, = e" = e*("), which describes a plane wave of Wave 
number k. Such waves are known as “‘spin waves.” Thus, 


Pure... eta. (7.42) 
a Na 
For small ka, which means long waves, cos ka = 1 — k?a?/2, and 
E = 2Aa’k? (long wavelength). (7.43) 


The extension to three dimensions gives us no trouble: The lowest state again 
has all spins parallel. Subtracting out the ground energy, we obtain 


H’ = — > Am(On* nim — 1) 
N,M 
and, in a manner similar to that used in the one-dimensional case, 
Wy = ¥ e&* N4Wy)> and E = ~ ¥ Ay(e'*'™ — 1). 
N M 
For nearest-neighbor interactions (in a cubic crystal), 


E = —2A{(e'®=* — 1) + (e7'®** — 1) + (e!®* — 1) 4 ++} 


= BA Jsin? Be? 4 gin? Ret 4. gin? Sel (7.44) 
2 2 2 
For long waves 
E = 2Aa’?K? = K?/2p, (7.45) 
where 
1/2u = 2Aa?. 


To find the next band of energies, we must consider the case when all the 
spins but two are parallel. But first we will digress to discuss the semiclassical 
interpretation of spin waves. Problems dealing only with spins have often an 
immediate semiclassical interpretation. Let us look at the spin wave case. 


7.4 SEMICLASSICAL INTERPRETATION OF SPIN WAVES 


The Hamiltonian is 


N N 
H = —2A yy (ph™tt— 1) =—-A Dy (o.*6,44-=-1): (7.46) 
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The Heisenberg equation of motion for the nth spin is 


— : (He, — ¢,H). (7.47) 


We transform this using the commutation relations: 
6,0, — 6,0, = 2id, etc. (7.48) 
so that 
(62° 04x — Fyx(62 °° G1) = (62.045 + F2yF1y + 52,04 ,)04x — O1x(°**) 
= 2i(6 1,02, oF 6 1.92y) 


2i(a, X o)x. (7.49) 


i 


Using Eq. (7.49) in Eq. (7.47), we can write 
he, = 2A(6, X On41 — G—1 X Gy): (7.50) 
We regard this as the classical equation of motion for the vector o,. This 


is a norilinear equation, but it can be linearized when we consider o,, to be 
close to 1, in which case we can approximate thus: 


ee = 2A29;, 5 — GOn+t,y — On-1,y) 
~ 
~ 


hay % —2A(26nx — Snsie ~ Fn—1,2)- (7.51) 


We expect the vector o,, to rotate around the z-axis, and we also expect the 
solution to be of the form 


Gnx ® ¢ sin wte™*, 
: (7.52) 
Tny & € cos wte™*, 
where a is the lattice constant. When we use Eq. (7.52) we see that 
ha = 4A(1 — cos ak). (7.53) 


7.5 TWO SPIN WAVES 


When the spins at positions 7, and 1, point downward we define the wave 
function Digs pecs 


lPn, np) = |e-+> xBa---aBa---a), (7.54) 
Where the B’s are in positions m, and ,. The Hamiltonian is the same as that of 
Eq. 7.46, and the Schrédinger equation is 


HW) = 44e? |W); Energy = 446 (7.55) 
Where 


hae eae Sea ee (a (7.56) 


ny<n2 
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When Eq. (7.56) is used in Eq. (7.55) and the coefficients of ¢,, ,, are compared 
we can distinguish two cases. When , ¥ 1, + 1, in other words when the 
two down spins at n, and 7, are nonadjacent, we have 
24, na = (Qn, no = Oy 5 55) + (Gn, np _ ae eee) 
+ (Qn, ny _ Qn, ,na- 1) + (Ags - Dyn +1) (7.57) 
On the other hand, when n, = n, + 1 the two down spins are next to each 
other and Eq. (7.57) does not hold. We then have 
264 n+ 1 = (Ann +1 = 7 Re eee p + (Quint 1 = Ann+2)= (7.58) 


Before solving Eqs. (7.57) and (7.58) rigorously let us derive some approx- 
imate results. Because the case n, = n, + | is only one out of N (the number 
of lattice points), for most of the time Eq. (7.57) holds. Therefore, the first 
crude approximation should correspond to two noninteracting waves: 
ee eikiniagikon2a (7.59) 


When we use Eq. (7.59) in Eq. (7.57), we see that 


Q, 


é?) = (1 — cos k,a) + (1 — cos kya) 
e(k,) + ek), (7.60) 
where 4Ae(k,) is the energy for one spin wave of wave number k,. 


At low temperatures, only the lowest energy modes are excited; thus for 
one spin wave, the energy might be, 


2 
a a4 
Qu 4Aa? 


and for two spin waves 


pak 4k 
Qu py 


We can form wave packets with group velocity @w/ék, where E = fiw and the 
phase velocity is w/k. The statements above bring out strikingly the similarity 
of spin waves to crystal vibrations or phonons. By analogy to the concepts of 
“phonon,” “photon,” and so on, we call a spin-wave excitation a “‘magnon.”’ 
Since the approximate system discussed above is made up of independent Bose 
particles, we can use the expressions that we discussed before. The free energy 
Fis 
3 
F = kT fre (1 — e BEM) a : (7.61) 
(2n)° 

For low temperatures only the E(k) near the bottom are significant, so that we 
may approximate 


E(k) = k?/2u, 
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Scattering 


Fig. 7.2 Two spin waves, or the scattering of two quasi-particles. 


and 
oC 2 
FakT| ini — emt Wek dk (7.62) 
0 (2n)° 
The energy is 
k? Qu 4nk? dk 
U = | oF /2ukeT (Qn)? . (7.63) 


This energy depends on T to the 5/2 power: 
UxeT. 

Therefore the specific heat is proportional to the 3/2 power of T: 
Car: 


This result for spin waves was derived by Bloch. 
Although the system is made up of ideal Bose particles, Bose-Einstein 
condensation does not occur because the number of particles is not fixed. 


Problem: When there is an external magnetic field, derive the energy of the 
spin waves. Find the magnetic susceptibility. 


76 TWO SPIN WAVES (RIGOROUS TREATMENT) 


Bethe treated the two spin waves in a linear system rigorously.* When there 
are two spin waves in a system, the problem can be looked upon as a scattering 
of two quasiparticles, as in Fig. 7.2. For the scattering process the energy and 
the momentum are conserved, and the eigenstate is a superposition of the 
incoming and outgoing waves: 

a = qeei(kinia + k2n2a) + Belthamiat kinza), (7.64) 


m1,M2 


__ From this point onward we will let a = 1. That is, for convenience we are 
either assuming unit lattice spacing or absorbing the factor a ink. As mentioned 
Carlier, n, < nz. The ratio «/f of the coefficients is related to the phase shift 


* See A. Sommerfeld and H. Bethe, Handbuch der Physik, Vol. XXIV/2, p. 604 (1933)- 
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resulting from the scattering, and it is to be determined from the scattering 


equation. 
The left-hand side, a,,,,, of Eq. (7.64), must satisfy Eqs. (7.57) and (7.58), 
First, when a, ,,, is substituted in Eq. (7.57), we see that 


e?) = (1 — cosk,) + (1 — cos ky). (7.65) 


ny ,R2 


This equation has the same form as Eq. (7.60), although we expect k,; now 
satisfies a different relation from that found in Section 7.1. 

Next we can use Eq. (7.64) in Eq. (7.58). Rather than doing that immediately, 
notice that Eq. (7.58) is a special case of Eq. (7.57) if we define a, , (which was 
left unspecified before) by 


24nn+1 = Quin + Antintt (7.66) 


Then, substituting Eq. (7.64) in Eq. (7.66) is equivalent to substituting Eq. (7.64) 
in Eq. (7.58). From Egs. (7.66) and (7.64) we have 


2(aei*? 4 Bei*) =a+ B + aeel(ki +k2) ~t Beilki tha) 


It follows that 


ot 1 + elk the) _ Qeiks 
B ee 4 gilkitka) _ Qeika 
— pilki~k2)/2 
_ _ 0s (k, + k,)/2 — e ee (7.61) 
cos (k, + k,)/2 — e7 We ~ #2 
Ifk, and k, are real, 
t= where =z = icos~1—1 M2 ~ jgitts—en0/2, 
Bp z* 2 
Therefore |«/$| = 1, and in fact 
a/p = e'*, (7.68) 
where /2 is the phase of z: 
cot 2? = REZ = sin (ky — k,)/2 
2 Imz_ cos(k, + k,)/2 — cos (k, — k,)/2 
ce wee aeot ; 
2 2 2 
or 
2 cot 2 = cot i, cot i ; (7.69) 
2 2 2 


We will also consider cases in which k, and k, are complex. Then |a/f| is n° 
longer 1, but g may still be defined by Eq. (7.68) (@ will now be complex)- 
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Equation (7.69) holds, for such cases, as an algebraic identity or by analytic 
continuation from the case of real g. 

For scattering problems we assume k, and k, to be real. Then [a/f{ = 1, 
as is expected, and g, which is real in this case, is the phase shift after the 
scattering. One special solution is 


— pig/2 
ne thy (7.70) 
B =e ip/2 
From the substitution of Eq. (7.70) in Eq. (7.64), this becomes 
a as gilkint +kan2to/2) + eilkami tkinz~@/2) (7.71) 


When we impose the periodicity requirement with the period of NV, we see that 


a (7.72) 


n2nytNn =a 


my M2" 


(Because 1, + N > ng, we do not write dy.4nny = ny,n2-) Equation (7.72) is 
satisfied when, from Eq. (7.71), 


ee = 21m, + @ 


(7.73) 
Nk, = 2am, — Q, 


where m, and m, are integers. It must be noticed here that Nk, is not equal to 

2nm,, as was the case for one spin wave, but is changed slightly by the amount 

of g. Using Eqs. (7.69) and (7.73) we can eliminate k, and k,, and determine 

as a function of m, and my. 

It can be shown that the real values of & do not exhaust the possibilities. 

We can examine the case of complex wave number by putting 
ee ae, (7.74) 
k, =u — iv. 

Equations (7.70) through (7.73) still hold (although g is complex). With 

Eqs. (7.74) we can rewrite Eq. (7.73) in terms of uv and v: 


= 
Il 


aa + m)), (7.75) 


gy = mm, — m,) + iNv. (7.76) 


From Eq. (7.75) it follows that w is real. Using Eqs. (7.74) in Eq. (7.67), and 
(7.76) in Eq. (7.68), we obtain a relation between w and v: 


Il 


: —v 
eit(m2—i) 9 ~ Nv _ f0s of =e. ° 


cos u — e” 


Now, if we hold v fixed and let N > oo, the left hand side — 0; so we have for 
large N 
e-° & cos yu, (7.77) 
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so that v is approximately real. (We have assumed that Re v > 0, but the 
following result will be the same if Re v < 0.) The energy is derived from 
Eqs. (7.65) and (7.74); use of Eq. (7.77) leads to 


a?) = 2 — cos (u + iv) — cos (u — iv) - 


2(1 — cos u cosh v) 
~ sin? u = $(1 — cosk), (7.78) 
where we have defined 
k= 2u =k, + ky. (7.79) 
To get a,, », We use Eqs. (7.74) and (7.76) in Eq. (7.71). The result is (with a 


change in the normalization) 


i N 
Gs, yet rt cosh 7G + ny — nz 


or (7.80) 


, , N 
fgg See ee sinh (3 + my — nz)’ 


according to whether m, + mz is even or odd. a,, ,,, has a sharp maximum when 
ny = ny (or n, = nz — N) and decreases as n, — n, increases (remember we 
assumed n, S np). 

Our solution represents a bound state. What is the binding energy for this 
quasiparticle? When there are two independent spin waves both having wave 
number K, the energy is 


é= 21 — cos K) = K? — K*/12 +-:- (7.81) 
For a quasiparticle of wave number 2K, we find from Eq. (7.78) the energy 
é = $(1 — cos 2K) = K* — K*/3 +-->. (7.82) 


Equation 7.82 gives a lower energy than Eq. (7.81) does, but the difference is 
very small, namely, K*/4. In other words the binding is very weak, and it 
depends on K in such a way that the long-wavelength spin waves do not interact 
much. This is the reason why the approximation of non-interacting spin 
waves is a good approximation for low temperatures. 


7.1 SCATTERING OF TWO SPIN WAVES 


In this section and the one that follows we will approach the two-spin-wave 
system in a slightly different way. In Section 7.6 we found the eigenstates of the 
Hamiltonian containing two spin waves. Here we will consider the simple waves 
found in Section 7.5, which are not eigenstates of H but describe pairs of 
independent spin waves. Then we will apply perturbation theory to calculate 
the amplitude for scattering of these waves. 
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The general Hamiltonian for spin waves in three dimensions is written as 


H== Y 24" ™ = 1; (7.83) 
M,N 


where Ay is a constant. For the one-dimensional, nearest-neighbor exchange 
interaction case, we may write 


H = —2AY(p"*! — 1), (7.84) 
N 


The scattering of two spin waves will be described in this section for the one- 
dimensional case, although a similar analysis holds for the case of three dimen- 
sions. 

We know that a ground state of Eq. (7.84) is 


lop) = lax---a), (7.85) 
for which all spins point up. When one of the spins points down, 
1 ikn 
We) = = > e* IQn>s (7.86) 


VN 


is an eigenfunction of Eq. (7.84) with the eigenvalue 


E, = 4A(1 — cos k). (7.87) 
When there are two spins pointing down, 
= 1 ikyny ik2n2 
Wied = Tae >» ane ica” ee (7.88) 


describes a state in which two spin waves exist. However, Eq. (7.88) is not an 
eigenfunction of Eq. (7.84), as we saw above, and the two spin waves scatter 
each other. Let us examine how the two spin waves interact. By definition, 
l@nin2> == |Pngn,> is the state where spins are down at 7, and 72, and |,,,,> = 0. 
We want to evaluate 


JN? H\Wrka> = by ee Oe 5. 


nina 
Here H|q,,,,> is calculated for four cases: 
i) n, and n, are separated, that ism, # m,n. 4m, + I,m, #n, — 1, 
li) ny =n, + 1. 
iii) ny =n, — 1. 
iv) n, = n,. In this case IPnim > = 0. 
For case (i), 
flO .4,) = —2AlN Ga aed — nid) F Ua, ig? = 1Pnind) 
AMOR wei) = Wei) Once" Wajes) 789) 


I 
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For case (ii), 


Al @nn+1> = —2A[(@n- 14041) a I@nn+1)) + (ln n+2> = IPnn+1>)]- 


In order to conform with Eq. (7.89) we will write the equation above in the form 


Al @nn+1> = =2A[ = 215.044) a (l@n—1 241) = lPan+1)) 

A Peee ap IPnn+1>)] a 4A Pnnsi)> (7.90) 
where the term in square brackets is Eq. (7.89) with n, = n, + 1. Similarly, 
for case (iii), 

FN @an-1) = —2A[—2l@un—1> + (IPnn-2> — IPna—1>) 
+ (Pn 1.n-12 ~ 1nn-12)] — 4Al@nn-1>- (7.91) 
For case (iv), 


Al@nn) = 9 
o ~2A[|@n+1.2> F IPn-149) + IPnnt1> + lOgyaad 
+4A[|Pn na i> P lPnn—1>]- (7.92) 


Here the first term is Eq. (7.89) with ny = ny. = 

When we use Eqs. (7.89) through (7.92) to evaluate VN? Hly,,,,), the 
first terms of Eqs. (7.90), (7.91), and (7.92) combined with Eq. (7.89) lead to 
the sum of Eq. (7.92) over n, and n, without restriction; thus 
VN? Wiss —2A >, eta ee - 1) + (ei — 1) 


nyn2 


i 


+(e ~ 1) +(e°™ ~ b] 


se 4A XY eer ONG: ara) + eT IN Gz 25 
N 


— Ce Ooaad + @uaoids 


4A[(1 — cos ky) + (1 — cos ky)IVN? Wi, + VN2 Ix). 
(7.93) 


The “remainder” |x) is written as 


VN? |x) = ~4A Y ert hams 4. eft — 1 — elatiadyig 15 (7.94) 


The first term of Eq. (7.93) is, as seen in Eq. (7.88), a double sum over #, 
whereas Eq. (7.94) is a single sum over n. Therefore, the second term of 
Eq. (7.93) can be regarded as expressing the scattering of the two spin waves 
that appear in the first term of Eq. (7.93). 

Notice that when H operates on |y,,,,> @ new state emerges, but the new 
state is still a combination of states with two flipped spins. In other words, 
when two spin waves encounter, two spin waves come out. 


71.8 Non-orthogonality 215 


With the help of Eq. (7.87) we may write Eq. (7.93) as 


Wises? = (E,, ot Ex) Wises? + Ix), (7.95) 
where 
4A 
|x) SS eh Tames (7.96) 
VN?" 
with 
E = ef 4 elt — 1 gli tha), (7.97) 


7.88 NON-ORTHOGONALITY 


‘The general theory of scattering tells us that the cross section o satisfies 


(number of final states) 


ov = a IM;\|? x 
h AE 


(7.98) 


where v is the velocity of the colliding particles. From perturbation theory, 


Mi Hep op rll ye (7.99) 
n#i E; — E, 
Hj, is the matrix element of the perturbation Hamiltonian between the initial 
and final states. In this section the discussion will be limited to the first term 
of Eq. (7.99). 

We are interested in the scattering from the initial state |y,, ,,) to the final 
state |W,, 1,> where 

E,, + &, = Er, + Et, (7.100) 


Then we cannot use Eq. (7.99) as such, because the states |W,,,,> are not 


orthogonal to each other. This is shown as follows. We compute the inner 
product, using Eq. (7.88): 


1 i(kin n2—Lyni- f 
» s et e =f P22 Dns! Onna? (7.101) 


Wr 1alWe > eerery 
1L2 15k2 N?2 ar ire 

From the definition of |¢,,,,>, we see that 
« = h — _ , ies les , 
PninslPninz> = L when ny = ny # nz = ny Or ny =n, AM, = 4, 


, 


= 0 otherwise, including the case nj = n, = ny = Ng 


(7.102) 
Thus, we can write 
= e. i[ (ki ~ £1)ny + (k2— L2)n2) i(k: — L2)ai + (k2-L1)n2) 
Wri tal Werks? 3 {e +e } 
nin2 
2 i(k, +k2—Ly—L2)n 
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In the first sum of Eq. (7.103), 7, and n, are summed without constraints, so 
that we see 


a a) eller —Lidmi + (ka—L2)ma] — | when k, = L, and k, = L, 
N?2 nynz 


= 0 otherwise. (7.104) 


If the second term of Eq. (7.103) were absent, that equation would show the 
orthogonality of W,,,,- If k,, k, are not equal to L,, L, but 


(eek) Hh £13 (7.105) 


where ‘‘0” denotes any integral multiple of 2x. Then, although the first sum of 
Eq. (7.103) vanishes, the last term contributes —2/N. Thus, |i,,,,> are not 
strictly orthogonal to each other. 
To calculate the scattering amplitude we consider the first term of Eq. (7.99), 
using for the perturbation Hamiltonian 
H’ = H — (E,, + &,,). (7.106) 
Therefore, 
My = (ra — (E;, + Ex) (Wires? = (WrytalX)s (7.107) 


where we used Eq. (7.95). Using Eqs. (7.96) and (7.88), we see that 


= -i 
M;; = >) yee Marae Cham tLe do. | Pa; ee, 


n myn2 
= nt ») eilki +ka- Iy- Eee te + ew hye (7.108) 


because 


CPningIPnnti> = 1 when ny =n, ng =n+1 or mn =nt il, n= 


= 0 otherwise. 
Further, in Eq. (7.108) 
yy fr the is~ tan = N when kk, + kz = Ly + L, + “0” 
= 0 otherwise. 
Thus, using Eq. (7.97), we have 
My = — “ (e7 Et 4 e Haye 4 eft — 1 — elite) (7,109) 


when k, + k, = L, + Lz + “0”. Because of this relation between k and L, 
and also the relation in Eq. (7.100), M,; is symmetric in k and L. 
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When we expand M,, for small k; we see that 
|M|? ~ (k, + k2)’. 


Therefore, unless k; is large, the scattering cross section is small. 


79 OPERATOR METHOD 


It is possible to rewrite the Hamiltonian for spin waves in terms of creation and 
annihilation operators. 
Using the spin operators o, and o,, we define o, and a_ by 


o, + ig, | dG, — id, 
Ge, Ag ee, 7.110 
n 5 5 ( ) 


Remembering that 


0 1 0 -i 
0, = ( | and oO, = i) (7.111) 


0 1 0 0 
6, = (; 7 and o_= i 1) (7.112) 


It is easy to show that 


we have 


o,0_ =43+ to,. (7.113) 


When |g» is the ‘“‘vacuum”’ state in which all spins are up, we can write 
the state |g,» in which the spin is down at N as 


ln) = Fy-|Go).- (7.114) 
The state |, of Eq. (7.86) (generalized to three dimensions) may be expressed as 
1 2 
Io) = (+ Tet “on-}ivo) = ai! |p0), (7.115) 
VNF 
where we define the creation operator of a magnon to be 
ike No 
ay x e : (7.116) 
a NR 
The destruction operator is then 
a= —= » eg RNG... (7.117) 


7 


Further, |W,, > of Eq. (7-88) is then 
Wists? = 44,4x,|P0)- (7.118) 
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To see that this is correct, remember that 
o_o. =0. 
Equation (7.117) may be solved for oy. as 


One = ety) sh Ae (7.119) 
VN® 


The Hamiltonian of Eq. (7.83) includes p¥**™, which contains oy . On, 4 
or terms of the form 


d102 + ayor + 6302, 
in which the first two terms can be rewritten in terms of products of two a’s, 
The last term corresponds to products of four a’s because of the result shown in 


Eq. (7.113). 
Now let us find out if the a’s satisfy commutation relations such as 


[a,, a6 |] = Sue. (7.120) 


The answer is no, and is related to the fact that the y,,,, are not orthogonal to 
each other. We can disprove Eq. (7.120) using Eqs. (7.116) and (7.117) together 
with the fact that [o,, ¢_] = a,: 


1 err 
[a,, ap] = N p> ML ati Ce ae 
1iN2 
1 —i(k-L)+N 
=—_— e o 2 
NF ie 
= Onn + a) e-E)N(gy, — 1), (7.121) 
N 
because 
1 eae 
ee he 
N 


On account of the second term in Eq. (7.121), Eq. (7.120) does not hold exactly. 
although the second term is numerically of the order of 1/N when there are only’ 
a few particles with spin down. 


7.10 SCATTERING OF SPIN WAVES—OSCILLATOR ANALOG 


Figure 7.3 shows two waves of momenta K, and K, interacting and “emerging 
with momenta L, and L,. The scattering matrix M,, is given by 


M,, = —2A¥ ef Md — e MR, — @iMe Ms), (7.122) 
M 
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L)=K,+Q 
Lo=K2—Q 


K, Ky 


Fig. 7.3 Scattering of two spin waves. 


For small K,, Ky, M;; ~ K,K, ~ K?. The scattering cross section a is given 
approximately by 
a ~ |M\? ~ K+. (7.123) 


At a temperature 7, AK?/2u = kT and K? = T. From Eggs. (7.122) and 
(7.123), ¢ ~ T?. For ordinary particles, M,;; ~ 1; hence o ~ I, and the virial 
coefficient b, ~ T'/?, In the case of spin waves, however, o ~ T? and thus 


b, ~ T?/?, (7.124) 


It is plausible to suppose that in a virial expansion we must first consider 
independent spin waves, for the next term, two-particle collisions, then three, 
and so on. Such plausibility arguments can be made rigorous by the use of an 
“harmonic oscillator analog.” 

In this analog, corresponding to the spin state where the ith spin is down we 
will take the oscillator state where the ith oscillator is excited to the first level 
and the others are not excited at all. Hence, 00100 corresponds to axfaa, and 
so on. Let ay be the creation operator for the oscillators. That is, «y creates a 
phonon (say) in the Nth oscillator. 


tS ae (7.125) 
N 


Now the real H (for spin waves) acts as follows: 


(aBane) 
H(aaBan) > (anaay (7.126) 
We want now to construct an analog H for the oscillators so that 
(01000) 
H(00100) > (00010) 
Now 
H= 3 [—2A(a;5, 0; — a*a,) — 2A(ao 2, — a> 0,)] 
=i > = Ay(Om+n — ay Oy = > ExaK ax, (7.127) 
N K 
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where 
&x = -y Ayien™ = 1) 
M 


will do the trick as long as only one B is present. However, consider the 
following 


(aBaBac) 
H(aaB Baa) > (ooBarBer) (7.128) 
However, the H defined by Eq. (7.127) acts differently: 
H(001100) — (010100) 
_ (001010) 
v2 (000200) < 
2 (002000) — 
Note that the arrowed functions in Eq. (7.129) have no analog in Eq. (7.128). 


Therefore we must add terms to H so that these arrowed terms will disappear: 
that is, we want to define a new H’ = H + A such that 


(010100) 
(001010) ° 


Hence the term A that is added to H behaves like 


—/2 (000200) 
—/2 (002000) ' 


(7.129) 


H’(00100) > 


A(001100) — 


Then 
A = PAL or ai in te — OG 1 Oe 1) (7.130) 


is the new term we need. For example, «;,,%; annihilates two adjacent phonons 
atiandi + I, and a; ,a;4,, creates two ati + 1. 

H' = H + A defined as above is not Hermitian, and terms such as 
or;*, ,0j° %0; must be added. H is an exact analog for the real H if states with only 
one f are considered; H’ is exact if states with one or two f’s are considered: 
and so forth. 

Note that the state (00200) may contribute an energy E, such that e **' is 
not negligible, in which case the e~ 4" term must be subtracted out when forming 
the partition function, e~ 4», 


Problem: Using «,’s and «{’s which satisfy anticommutation relations will solve 
the problem of states with more than one “particle.” What are the difficulties” 


CHAPTER 8 


POLARON PROBLEM 


8.1 INTRODUCTION 


An electron in an ionic crystal polarizes the lattice in its neighborhood. The 
interaction changes the energy of the electron, and furthermore, when the electron 
moves the polarization state must move with it. An electron moving with its 
accompanying distortion of the lattice has sometimes been called a polaron. 
It has an effective mass higher than that of the electron. We wish to compute 
the energy and effective mass of such an electron. 

Even without any vibration, there is a (periodic) potential acting on the 
electron. This potential will be approximated by assuming that its presence 
causes the electron to behave as a free particle but with an effective mass, M. 
Hence the energy of the electron when there are no lattice vibrations is P?/2M, 
where P is the electron’s momentum. 

It will be recalled that the dispersion in a crystal is as shown in Fig. 8.1. 

We will be interested in the region described by the almost constant part of 
the optical branch, that is, the branch for which the positive ions move in a 
direction opposite to that of the neighboring negative ions. 

If we consider the permittivity as a function of frequency (Fig. 8.2), the 
Portion of the curve that will interest us is that part where the lattice vibrations 
are important. 


w 


4 
PU Optical branch 


> K 


Fig. 8.1 Dispersion in a crystal. 
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— +— Lattice vibrations 


Fig. 8.2. Permittivity as a function of frequency. 


Let us take the crystal lattice as being a continuum with a unit cell at each 
point x. The lattice vibrations are quantized according to Section 6.10, where 
the transition from a discrete lattice to a continuous one is made according to 
Eq. (6.201) of that section. Before writing down the Hamiltonian let us consider 
which vibrational modes actually interact with the electron; the others may be 
ignored in this problem. 

Using the notation of Section 6.10, we assume that the electron’s potential 
due to an undisturbed lattice is either zero or else is taken into account by 
reducing the mass as already described. If the lattice is displaced, the potential 
AV;,(x) felt by the electron is due to the charge density p(x) resulting from the 
displacement, and this in turn is due to the polarization P(x): 


V? AV, (x) = ep(x) = —eV- P(x) (8.1) 


(where the electron has charge —e). Assume that the crystal lattice has a 
positive ion and a negative ion in each unit cell. There are then six modes of 
vibration for each wave number K. In three of the modes both ions move the 
same distance in the same direction; as K — 0 these modes approach rigid 
translations of the whole crystal and w(K) — 0. Such modes do not contribute 
much to the polarization P(x). In the other three modes the positive and negative 
ions move in opposite directions in each unit cell, producing a polarization that 
is proportional to the amplitude of the mode. For these modes the frequenc} 
approaches a nonzero value w as K > 0, and we shall make the approximation 
that w(K) = o for all K. That is, we assume that the frequency is independent 
of the wave number. 

Let a*(K, a) be the creation operator for phonons. By analogy with 
Eqs. (6.200) and (6.202) (in which A*(k, a) is the creation operator for phonons) 
we see that the polarization (which must be proportional to the displacement) !$ 
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of the form 


3 3 
Bosal ® 


~~ § [atk, ae* "en, + at (K, aje”'* *ek |] 
(2x)° a=} : 
where ’ is a real constant. The charge density is then 

p(x) = -V- P(x) 


3 3 
Pm ses ‘3 
(22)? a= 


[a(K, ajelX'*K - ex, — a*(K, aje'*'*K - ex |]. 
1 


To a very good approximation we have one ex, which is in the K direction 
(the longitudinal optical mode) and therefore two which are perpendicular to it 
(transversal optical modes). Actually these differ in frequency, and have a 
frequency ratio of (é/e,,)'/”, where & and e,, are the static and high frequency 
dielectric constants of the crystals. Only the longitudinal mode, the one with 
(ex,,|K) contributes to p(x); denote the creation operator for that mode simply 
by ag and ignore the other modes. Thus 


3 
px) = ia’ | os K[agelt’* — agen", (8.2) 


so that, from Eq. (8.1), the electron’s potential energy due to the lattice vibrations 

is 

ee 
(2x)? K 

fou d°K 1 GosiRts 


ial 20 ( Mm) lomsKie® axe*'*], (8.3) 


AV (x) = —iex [ape ase **] 


where « is a dimensionless constant. It can be shown that 


_1(4 _ 1) e (2Me\"? aa 
2\e, E/hwo\ fh 


where ¢, and ¢,, are again the static and high-frequency dielectric constants of the 
Crystal. In a typical case such as sodium chloride, « is about 5, and in general it 
Tuns from about | to 20. 

The Hamiltonian for the free electron and lattice is 


P? d?k 
Ho + Hose = 2 > + hw | — = aga, 8.5 
| pat | Gays ae (8.5) 


Where we are considering only one electron with momentum operator P, and 
We are ignoring the other lattice modes (which do not interact with the electron 
8Ccording to our assumptions). 
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To simplify slightly the calculations of the following sections we shall assume 
the units to be such that A = M = w = 1. Also we shall sometimes use the 
notation 


= d°K * 


pi Momies sts 353 és , : 
= Qn? > Oxk (2n)"6°(K — K’) 


y 

K 

With the assumptions made above (due to Froéhlich*), the problem j, 
reduced to that of finding the properties of the following Hamiltonian: 


H = $P? + Yagay + i(/2 an)!/? re [age ** — aye™®*], (8.6) 
K K 


Here, X is the vector position of the electron, P its conjugate momentum, and 
ax, dx are the creation and annihilation operators of a phonon of momentum K, 

The polaron problem is not of special importance, but we have now reduced 
it to mathematics. The method we shall use to solve this mathematical problem 
will be applicable to different problems of a similar nature. Our method will be 
valid for arbitrary coupling, but we will first assume small « and will use con- 
ventional perturbation theory to get, first, an answer that we can compare with 
our results for arbitrary coupling, and second, a description of what happens if 
the electron goes too fast. 


8.2 PERTURBATION TREATMENT OF THE POLARON PROBLEM 


We wish to find AE, the perturbation energy when there are originally no lattice 
vibrations. The Hamiltonian is H = H, + H’ where 


Pp? 
Hy — 2 + y 9K ax 
a 1 a - 
H' = i(/2 nay eK (age * — aye'®%), 
K 
‘i H’ 
AE, = Hio + Dag poe (8.7) 
2 oo” #n 


where Hj, = (m|H'\n), |m) and |n) are eigenstates of Hg and E® is the 
eigenvalue of Hg corresponding to |n). 

Because H’ acting on a state changes the number of phonons, we have 
Hb, = 0. We are interested in the energy of an electron traveling with the 
momentum P. Therefore, we consider the diagram in Fig. 8.3. Here the initial 


* H. Frohlich, Adv. in Physics, 3, 325 (1954). 
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P 


Fig. 8.3 Electron with momentum (P — K) and phonon with momentum K. 


state has an electron of momentum P and no phonons. In the intermediate state 
the electron’s momentum is (P — K) and a phonon of momentum K exists. 
Thus the energies E? and E° are 


- +1. (8.8) 
Here “1” in E° is the energy of the phonon (in our present units). 
~ 1 F , 
Hyo = i(V2 nx)!/?¢n| > K (axe '*** — aye'*’®)|P; no phonons), 
K 

which is 0 unless |n) represents a state with one phonon and an electron of 
Momentum P’. If the phonon has momentum K, 

Ayo = i(2 na)'/?< P’, no phonons |(1/K)e~'*"*|P, no phonons). (8.9) 
Since e~'*'X|p) = |P — K), P’ must be P — K, as expected, so 


Ho = (2 ma)'!?(1/K Sp: p— xs 
and 


c r 1 
SECS V2 na Ya op —K)y/2+1— P2/2 


x 2. (8.10) 


The extra factor of 2 is a consequence of the spin of the electron. There are two 
intermediate states with momentum P — K. Generalizing Eq. (8.10) to three 
dimensions and writing the sum as an integral, we find 


Q 2d3K/(2n)3 
AE, = =22 mx Rie a ae (8.11) 
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In evaluating this integral, we may make use of the identity 
_ dx 
9 [ex + bl — x)? 
Regarding b = K? anda = K* — 2P-K + 2, wecan write Eq. (8.11) as 


(8.12) 


AE, 


G d°?K 1 dx 
—4/2 an (2n)? |, [x(K? ~ 2P-K + 2) + (1 — x)K?} 


7 i d?K/j(2n)° 
—AnJ/2 a {, dx lw — 2xP-K+ 2x)? 


ae ie d>Kj(2n)° 
— 4a 2 1 |, dx le — xP)? + (2x — x?P?]° 


Now we use the integral: 


{ @K\Qn> 1 
[K? +a)?  gnvJa 


to find 
= t dx 
, my an 8nJ2x — x?P? 

sh 0? get P 
8x =P /2 

Thus, finally, 7 

AE, = ag? Sisto. (8.13) 

P V2 


When P = 0, Eq. (8.13) gives 
AE, = -«. (8.14) 
If the perturbation expansion is carried to higher order,* Eq. (8.14) becomes 
AE = —« + 1.26(a/10)?. 
This is the perturbation energy when the electron is at rest. When P is small. 
we may expand Eq. (8.13) in the form 


a “LB? aa) * i 


2 


ene see ee eee (8.15) 
12 


*E, Haga Progr. Theoret. Physics (Japan) 11, 449 (1954). 
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When we combine this perturbation with the kinetic energy, the total energy if 
2 2 2 
P Fas sah tedtess Ee oy 
2 12 2(1 + «/6) 


Equation (8.16) shows that the mass of the electron is increased (1 + «/6) 
times by the interaction with the phonons. That is, 


ater, (8.16) 


LE eweae (8.17) 


In Eq. (8.13), if P > J 2: AE; becomes imaginary. This implies that the 
energy of the electron dissipates by creating a phonon in the same sense as in 
Cerenkov radiation. Let us evaluate the smallest value of P sufficient for this 
decay effect. Obviously, the process that could give the smallest possible P is 
the one in which P is in the direction of K. Consider the case shown in Fig. 8.4. 
By conservation of energy, we know that 


(RT pe FO 


; 8.18 
2 2 oP 
Equation 8.18 can be rewritten as 
eee (8.19) 
2 «K 


The smallest P that can satisfy Eq. (8.19) corresponds to the minimum value 
of the left-hand side, which is 2, so that 


P>VJ2 (8.20) 


is the condition for the “Cerenkov” effect to occur, as we might have expected 
from Eq. (8.16). . 
Let us next calculate the rate of this Cerenkov effect. This rate is given as 
Rate = Y 2nx|H;ol7d(E; — E,). (8.21) 
1 


Fina 
states 


The initial state i consists of an electron of momentum P, and in the final state 
we have an electron of momentum (P — K) and a phonon of momentum K. 
From Eq. (8.6), we have 


, u om 
Ay, = CP’, UxlHinlP> = rack na)" Sp px, 


Fig. 8.4 Electron dissipates energy by creating a phonon. 
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where 1, means one phonon of momentum K. Thus 
~ _ K\2 2 3 
Rate = La oq (V2 (fP- AYP? fES >, 
t K? 2 2 / (2n) 
The 6-function insures conservation of energy. We may write 


a?K = 4nK? dK(dQ/4n) 


so that 


ee 2 
Rate = 95/2 a {2 [exs(-PK cos 0 + ~ + t), (8.23) 
74 


where 6@ is the angle between P and K. 
When K, is a solution of 
K2 
—PK, cos @ + ci +1=0, (8.24) 


we may transform 


[axe(—Px cos 0 + £ + ) = [ax 5[4((K — P cos 6)? — (Ky — Pcos 6)’)| 


~ [Ky — Pcos | 
Thus Eq. (8.23) becomes 
— dQ 
= as eee 8.24 
Rate = 4V 20 lege — P cos 6| oa 


When the momentum of the incoming electron P is given, K, of the created 
phonon is a function of the angle @, and is given by Eq. (8.24). Solving Eq. (8.24) 
we obtain 


7 Pcos@=V2 > 
Rate = Oe a | See 2= = 24 V2 cosh7! ci : (8.25) 
0 VP? cos? 6 — 2 P af2 


It should be noticed that since PIV 2> 1,we may write 


o: a P . “ 
sin~! = = = + icosh it 


io 2 (2. 


Thus, cosh~! P/,/2 in Eq. (8.25) is the imaginary part of sin7! P//2, a facto? 
we obtained in Eq. (8.13). The connection between Eqs. (8.13) and (8.25) can 
be seen as follows. When the rate of transition is y, the amplitude of remaining 
in the original state has a factor e ”/?, and the time-dependent wave functio! 
has a factor 


eo OP MteWiEt e HE iv/2)t, 
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» Fo 
n 


eH 


Fig. 8.5 The summand in Eq. (9.7) 


thus —7/2 is the imaginary part of the energy. Equation (8.25) agrees with 
Eg. (8.13) because both are for the limit of small x and both are based on a 
perturbation treatment. 

Although we have found Eq. (8.25) to be consistent with Eq. (8.13), you 
may object; Eq. (8.13) should be valid only when P < /2. The situation is 
clarified as follows. Strictly speaking, Eq. (8.7) does not hold when E° = £4, or 
when the sum over E° diverges near Ej. Schematically drawn, the summand in 
Eq. (8.7) has the shape shown in Fig. 8.5. The correct way to take care of the 
region near EQ is to write the perturbation as 


AonH no 
AEx = Yo 8.7’ 
7 d — Eo + ie ei 
and take the limit as ¢ > 0. 
We see 


1 x ig 


x+ie x? +467 x? 4 2? 


The first term has the shape shown in Fig. 8.6, and to use this term is equivalent 
to taking the principal value when integrating. The imaginary part approaches xz 
times a 5-function as ¢ > 0, because 


fds a, 
nie KO Ee 


Thus we may write 


lim —- | = principal value (‘) — in d(x). (8.26) 
¢ a x 
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I(x) 


x 
Fig. 8.6 The functi ae 
ig. 8.6 The function f(x) Pg? 


Oo 


@ 


Fig. 8.7 The integral i 


2 2 
-or +8 


Thus Eq. (8.7’) may be written as 


H,,H. 
AE = On*4 nO 
o= 2 gen pp 
principal 
part 
Equation (8.27) shows the consistency of Eqs. (8.7) and (8.21). Although 


Eq. (8.13) was originally calculated for P < J >. the analytic continuation 
tacitly made by raising the pole slightly. 


f 


Problem: For low-frequency sound waves 
@ = KC,, 
C, being the velocity of sound. Calculate the critical value of P for radiation of 


sound. Calculate the direction of the emerging sound waves as a function ° 
energy. Evaluate the rate of emission. 
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3] FORMULATION FOR THE VARIATIONAL TREATMENT 
phe partition function may be written as 


ee Tee = vert (8.28) 


When f > ©, the leading term is e **°. Therefore, we claim 


bow 


lim |- ; In (Tr ) = oe (8.29) 


Phus, we first calculate the partition function to evaluate E,,,,. We are partic- 
ylarly interested in the case of large f. 
Using the path-integral representation, the partition function is written as 


Tr (e 8") = fe R(path). (8.30) 


The path-integral representation is used when the Hamiltonian is written in 
terms of the coordinates and momenta. 

Since H is written-in terms of the creation and annihilation operators we 
must first undo it and write it using coordinates and momenta. If we quantize 
the motion of a crystal, we must choose the creation and annihilation operators 
so that 


qk — (a t a_ ) 
mi K K 


i 


Pe = —= (aix — ax). 
V2 
But then the interaction between the electron and the phonons is of the form 


— 1 ; ; 
Hin = (2 na)'!? ¥ KI fate =X — gyi '*] 


K 


RS 1 
= i(V2 na)? 2 IK] (atx — ag)e'*'* 
K 


V2 (V2 na)? by Le Dae": 
K |K| 
We do not find the above form for H,,, suitable, because in path integrals we 
Want potentials that are functions of position, rather than of momentum. 
fe Can exchange the roles of gx and p,x either before quantization of the crystal 
otion (by means of a canonical transformation) or afterward, as follows. 
Let ax = —ia_,. Then if we define gj and pj as in Eq. (8.31) we find 


Qk = DK; Pe = —4x- 
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The resulting Hamiltonian (dropping the primes) is 


ie + 


H =4P? + ¥ pz + 4q2 + V2 (V2 an)!” » (8.33) 
K 


Thus, the Hamiltonian is written in terms of the electron coordinate y 
momentum P, phonon coordinates gx and momenta pg. We write 


Tr (eH) = | e°® Ox(u) Bqy(u) Bqylu)--- (8.33) 


X(0)=X(B) 
9i(0) = Gil B) 


where the action integral S is 


S= (PS = > 3(gx(u) + gx(u)) + J2 (V2 an)'? se ax(uye® a ; 


(8.34) 


The advantage of this method is that the path integral over the phonon 
coordinates can be performed because gx and gx both appear quadratically 
(and linearly) in Eq. (8.34). We use the result proved previously*, and find 


B 
for \- | (4G? + wg?) + aGaQa] au| 2q\u) 
0 


br 
= exp ie | i y(t)y(s)e@"~S! at as} . (8.35) 
40 Jo Jo 


Since y and q are complex here, we modify Eq. (8.35) to read y*(t)y(s) instead 
of »(t)y(s). We will need this modification after Eq. (8.36). Also, Eq. (8.35) is 
approximate, for e~%° is neglected.+ When we use Eq. (8.35), Eq. (8.33) is 
written as 


Tr (e784) = [eq Dx(u) 


= fow(-} | Pe an) exp (23 Z| [ ih) 
o K Jo Jo 


x exp (iK - (X(t) — X(s))e"'"""! dt ds) BX(u) 


* Equation (3.39.) 

+ The complete form is written by replacing e~@!*—sl in Eq. (8.35) by 
eo olt-sl e@lt—sl ge 8 

1— eB YH eB 


But remember that £ is very large. 
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or, changing Sx into { sis 


3 iK -(X(t)—X{s)) 
‘ we di — \'2an le ell dr ds. 
2 (Qn) K2 


; x B —|r-s| 
S= abe du — gi dt ds. (8.36) 
2 V8 Jo oy — X(s)| 


Actually, we are in error when we treat all the g,’s as independent real variables 
as we did in Eq. (8.35) above. The gx’s are complex, with gf = ¢_x. But if 
we went to the trouble of doing it properly, Eq. (8.36) would still come out as 
it did above. 

Our aim in using this path-integral approach is to combine it with a 
variational theorem described before. We write 


e* A(path) = J eS ~%e"S° D(path) 
fe Se Dpath) 


Ss 


So 


e~ 5° B(path) 


== (e7 S~S0)s [a Q (path). (8.37) 


where S, is an appropriately chosen approximation of S. We use the inequality 
Let) > ev SI? 


to write Eq. (8.37) as 


| e-5 G(path) > e~ <S~S0> | e © G(path), (8.38) 


where 
[(S = Sp)e~5* G(path) 


ee aie eS (path) (8.39) 


Because for the approximate choice of the Hamiltonian the free energy Fo is 
given by 


e PFo = [> (path), 
We can write Eq. (8.38) as 
Pere 4 es (8.40) 
Before choosing Sy, let us examine the meaning of S in Eq. (8.36). The 


St term is analogous to a kinetic energy, and the second term represents the 
Potential energy. The special feature of this potential is that the potential at 1, 
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where t is thought of as a “‘time,’”’ depends on the past, with weight e~!"~*!, This 
is a type of retarded potential, and the significance of the interaction with the 
past is that the perturbation caused by the moving electron takes “‘time” to 
propagate in the crystal. 

When a is small, we may take only the first term of Eq. (8.36) as So: 


So =4 [x 2(u) du. 
Then, from, Eqs. (8.40) and (8.29), we obtain 


E < By + lim | <S ~ S,). 


Bo 0 
This gives the old perturbation answer 


E-—E,< -4. 


8.4 THE VARIATIONAL TREATMENT 


We have seen that the use of S, for the kinetic energy 4 [§ X7dt gives the per- 
turbation result AE S —a (second-order perturbation) and furthermore this 
result is shown to be an upper bound (a result proved only by much greater 
effort with the usual methods). 

However, it is possible to imitate the physical situation much better with a 
more judicious choice of Sy and thus obtain a much better estimate of E. The 
next most obvious thing to try for S, is the action for an electron bound in a 
classical potential V(X). This choice can be shown to be equivalent to the use of 
some trial wave function in the ordinary (Ritz) variational method. In particular, 
if one chooses V(X) to be a Coulomb potential, one obtains for EF (at large ~) 
the same result as that which follows from a trial wave function of the form 
e~*r_ If one chooses V(X) to be a harmonic potential, then one obtains an 
improved estimate for & that could also be obtained with a trial wave function 
of the form e~ **? (at least at large «). 

However, it can be shown that for a less than approximately 6, no V(X) can 
improve on the result V = 0! This indicates that a classical potential is not 4 
very good representation of the physical situation, except possibly at very large 
binding energies. There are two major reasons for this. 

First, the electron is not constrained to any particular part of the crystal but 
is free to wander. Any potential V(X) obviously tends to keep the electron neat 
its minimum, 

Second, we see from the form of the exact action that the potential the 
electron feels at any “‘time” depends on its position at previous times, with 4 
weighting factor e~'*~s!, That is, the effect the electron has on the crystal 
propagates at a finite velocity and can make itself felt on the electron at a late? 
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Electron Particle 
<—- { on 
Mass = 1 Force =- Kf M 


Fig. 8.8 An electron coupled by a “spring” to another particle of mass M. 


time. This is less so for tighter binding, where the reaction of the crystal occurs 
much faster; hence a classical potential might well be expected to be a good 
approximation only for tight binding. 

A model that would not suffer from either of the above objections would 
be one where, instead of the electron’s being coupled to the lattice, it is coupled 
by some “spring” to another particle and the pair of particles are free to wander. 
See, for example, Fig. 8.8. The trial action for the system after the coordinates 
of the mass M have been eliminated is 


Bie Cs 29~Wit-sI 
So cect Me dpa [Xu — X]%e dt ds, (8.41) 
2 Jo 2 Jo Jo 


where W = /K/M and C’ = MW?/4. 


Of course, we could have seen mathematically that Sy would overcome the 
objections raised above without deriving Sy from any physical model, because 
we are free to choose any form for the trial action that we please (if it is not 
Complex). However, the recognition of the physical nature of So greatly helps 
in avoiding mathematical difficulties when questions of boundary conditions, 
asymptotic behavior and so on, arise, and also may facilitate computation 
(see below). 

We write according to our variational principle 


E<E,+ ae iS: (8.42) 


where E, is the binding energy of our model and ¢ ) means ‘“‘average with weight 
e~ So» 


Let 
I(K,t,s) = (elk (Xt) X68) 
f eK KO-Xy-S0 DX 
theese 
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If we can evaluate J(K, t, s), then it is easy to find <S — Sy): 


Arh eo lal dt ds 
sog ye oa ena ) 
Z 8 ({, i Ix(t) — X(s)| 


C BB 
7S (| | {X(t) — X(s)Pe Wis! dt is). 
0 0 
For example, 


BB 
({ | |[X(t) — X(s)e7 VES! dt is) 


Il 


BB 
| | dt dse~¥-*I¢|X(t) — X(s)/?) 


BB 
| | dt dse~™ "I — V2I(K, 1,5) ]q =o. 
10) 0) 


In general, for any function f of |X(t) — X(s)|, we can evaluate ¢f(|X(t) ~ 
X(s)|)> by Fourier analyzing f. 


tat _ [Lane OTN) dbK\ _ [ d2K 4n I(K, t,s). 
[X(t) ~ X(s)| K? (2x)? (22)? K? 


To evaluate I(K, t, s) we will work with the more general 


I(g,t,s) = (ex : i, gu): X(u) iu) ; 
0 


and then will specialize to the case g(u) = K[d(u — t) — d(u — s)]. We may 
proceed in several ways to do this general path integral: 


a) If we divide “‘time” into small intervals, we have a problem of the form 
[ [fe dX, dX,°+-exp(—Y A,jX;X; + i )) B,X)). (8.43) 
ij i 


This integral has been developed* for A;; a positive definite and self- 
adjoint matrix. The result is that the expression 8.43 becomes: 


(n)""? (Det A)~"? exp [—4 }) B,B(A~’);)]. (8.44) 


t 
Here, # is the number of time intervals in B, and it may tend to infinity after 
dividing by the proper normalization to make the quantity approach 1 as 
B, > 0. With this method we must insert the end-point conditions X, = Xo = 9 
by any one of several artifices (such as an added term in the driving force B). 


b) Alternatively we may proceed by recognizing from the physical mode! 
that Sy can be broken up into “normal modes,” that is, a free particle with the 
total mass M + | plus a harmonic oscillator of frequency v = Jw? + 4cii 


* B. Friedman, ‘Principles and Techniques of Applied Math,’ Wiley, N.Y., 1964. 
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and reduced mass M/(1 + M). We have already done the path integrals for a 
driven free particle and a harmonic oscillator, and hence we may write the 
desired result immediately. 


c) Alternatively, we may proceed to perform the integral by a technique* 
(used in previous lectures for solving the simple harmonic oscillator) that 
consists of finding the path, X’(t), which makes the driven action an extremum 
and noticing that (within a normalizing factor) the desired result is 


I = exp E {. g(t): X(t) a] : 


We will use a fourth method. Let 


X(t) = X(0) + y a, sin a 


Note that X(t) satisfies the requirement that X(0) = X(f). Integrating over all 
paths is equivalent to integrating over all X(0) and all a,. Working in one 
dimension (for simplicity), we first must find Sp in terms of on and a,,. 


ae Game Oh dae ee nat 4, _ | ia n? 
cf sae a? ——. cos? — = ; 8.45 
i 2 2 | Z Bp? ri 4 zo ( 


{. {, (XQ) — X(s)Pe7 "5 dt ds 
0 0 


BBT @ 2 
= al | b a, (sn ant _ sin | eWlt-sl dt ds 
2 Jo Jo L»=1 B B 


C 9) 2 2] 
my a x & ard ae. (8.46) 


NIA 


Equation (8.46) holds only approximately because we have made certain 
approximations that are valid in the limit of high 8. Then 


B oo oo 
i g(u)X(u) du = X(0)bo + ¥ a,b, = Y a,b,, (8.47) 
0 n=1 =1 
where 
B 
b, = if g(u) sin du = iK (sin - sin) 
0 B B B 
And 
1(K, 1,5) = Ml2cexp [~Znei (Anan = Qnby)] day daz-+* — ¢g 4g) 


fie. exp — (See, A,a7) da, da,°-: 


*R. P. Feynman, Phys. Rev. 97, 660, (1955). 
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where 


2-2 
Aga Et aes, ACW ; 
4B W? + n?x?/p? 


From Eq. (8.48) it follows that 


00 b2 
(3. ai) 


: w —K(sin (nat/B) — sin (nns/B))? 
o (2, n?n?/B(L + (4C/W)(W? + n?x? ey): (8.49) 


I(K, t, s) 


As f approaches oo, the sum in Eq. (8.49) becomes an integral with 
nn/B — x, t/B = (x/B) dn - dx. This integral can be evaluated to give 
K? ta < 4C 


I(CK, t, s) = exp} — — t 1 — ev lt-sv 
( ) P| aE evel 


fier WOK ager arr rq | , (8.50) 


where 
V2 = W? + 4C/W. 


I(K, 1, s) is used only in integrals over s and ¢ with large B. In such integrals 
there is a contribution only when K?W?/2V2|t — s| is not much greater than 
unity, that is, in an area of order 2BV?/K?W? (in the plane formed by the s and f 
axes). In essentially all of the regions of integration, the fact that f is large 


implies that e “*"”, e~?”', and e~ 2” are very small, and we can write 
(Ohi) Sex| Se pega SO aie (8.51) 
ak: 2. V4 wy? ; 


From Eq. (8.51) we get for one dimension 


6B 
(| { (X(t) — X(s)Pe" WF dt as) 
8B 7 d* 
= -| | dt dse~¥\'-s| —— I(K, t, 8) 
eh dK 


For three dimensions we must triple the result (since V - K = 3 in three dimen- 
sions). We find that 


ey — X(s)2e7 Wiest \ = 3BC 8.52 
5 L(x - xeore men) = FE 8.52) 


2p 


~ —-~as fp Ow. 


K=0 
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We can now find the Ey we wanted in Eq. (8.42) as follows: e~#F% — 
f e °C YX implies that, for large B, 
1 d 


E((C) = FAC) = Ces o~ BFolC) 


= S (-3[ fe ds|X(t) — Kove) 


3 3 
VW w/w? + 4c/w- 


Integrating, and using the fact that E,(0) = 0, we have 
E,(C) = 3V — W). (8.53) 
To get (S — S,> we must evaluate 


I, i: (Re xD a ae ds = eS Be {, 1(K, t, s)e~"*~S! dt ds. 
X(t) — X(s)| (22)? K? Jo Jo 


Let I(K, t, s)e~""~s! = g(|t — s|). Here we are assuming Eq. (8.51) to hold 
because f is large. Then it is easy to show that 


6 fb B 
| | g(|t — s|) dt ds = 2| (B — u)g(u) du. 
0 0 0 


Since g(u) ~ 0 for large u, Bg(u) > ug(u) whenever there is a contribution to the 
integral; so we can neglect ug(u). We can also let the upper limit of the integral 
be oo, Then 


OR reese as ) aed 
KO — X6) 


d°K 2(4n) (? _fK7] Wu 4C ae: 
(2n)> K? Jy du exp| eke * wr : r}+ at 


_ 4BV [” ew" du 


(8.54) 
VanJo JW2u + [(V? — W/V] — e7%”) 
Using Eqs. (8.52), (8.53), and (8.54), we arrive at the final result: 
V due“ 
AY Wit Ae erik sas z 
pce ae Ne odes I, [Wu + (V7 — wyV)a — em)? 


(8.55) 
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We may now vary V and W to obtain the lowest upper bound. (These results 
are the lowest ever obtained as upper bounds for this problem.) 


For small «. The best W = 3 and V = 3(1 + 2a(1 — P)/3W), where 


P= [a — w)'? - 1]. 


From this, we obtain 
E << -a — a7/81 = —a — 1.23(a/10)? (8.56) 
The correct result from perturbation theory to this order is 
E = —a — 1,26(a/10)?. 
For large «. The best W = | and V = (4a7/9x) — (4(In 2 + C/2) — 1), 
C = Euler Mascheroni constant = 0.5772,..., and 
E < ~—a?/2n — 3(2In2 + C) — 3 + O(1/a?)--- (8.57) 
which, up to « = 10%, is lower than any other estimate, upper bound or 
otherwise. 


For intermediate « A very great advantage of the method here is that it 
gives a smooth result for E from weak through intermediate through strong 
coupling, and it gives the only known reliable results for this problem in the 
intermediate coupling region. Unfortunately, a little machine calculation is 
required here to perform the integration in 1/B¢S). In case you would like to 
compare some other calculation, we give in Table 8.1 a few results obtained by 
Schultz* using Eq. (8.55) and a computer. 


Table 8.1 
a= 3 5 7 9 11 


Upper bound of E: — 3.1333 — 5.4401 — 8.1127 — 11.986 — 15.710 


When using an approximate method, we like to have some idea of the size 
of the error. Our approximation was to assume that the inequality 


Ke F) > e Sf> 


represents an equality. To check the error involved in this assumption, proceed 
as follows: 


Nya (ase eat KR 


ie Cle) ta Da beak 


* T. D. Schultz, ‘“‘Electron-Lattice Interactions in Polar Crystals,” Thesis, MIT, 1956. 
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For small f, a good approximation is to neglect all but the e~‘/? term on the 
right. But a better approximation (for small f) is to take 


fet) = exp[—<f> + Hf?) — <f)*)] + higher-order terms. 


Taking e &(F-Fo) = ¢e~ S50), we get 


ee ee 4 = S502 [Us = SyPSp = CS = 9492] 


1 
2p [ 
+ higher-order terms in (S — So). 


If S — So is not small, the correction term, 


1 2 
og AP ol ERS SoM, 


may not be an improvement, but it should be an order-of-magnitude estimate 
of the size of the error in the uncorrected estimate, F ~ Fo + (1/B)(S — So). 
Such an order-of-magnitude estimate of the error in the polaron energy has 
been carried out, and it turns out to be small. 


8.5 EFFECTIVE MASS 


A variational principle along the above lines has not been found for the calcula- 
tion of the effective mass of the polaron because we need paths e~* where S’ 
is complex. In fact, for the lowest energy of an electron with velocity U we 
need to path integrate using 


5)8 dt — 273g {[™ — X, + 2iU-(t — s)|"'e7"""5! dt ds 


S(X) 


SCX, + iUt), 


as we would expect in ““imaginary-time” space. However, if U were imaginary, 
then we would again have a real S’ and could use the variational method. Now 
One can show that E(|U|), considered as function of a complex variable, is 
analytic near |U| = 0. Hence Fin the neighborhood of |U| = Ocan be obtained 
by knowing E for imaginary |U|. Therefore, one can deduce that for small 
velocities the effective mass is given by 


oe) 2 =t 
m=1+ pear? | ame (8.58) 


o [W2t + (V2 — WV) — ep?’ 
where V and W are the best parameters found for E(U = 0). It is interesting to 
note that Eq. (8.58) gives a value for m that is always within a few percent of the 
total mass, 1 + M = V?/W?, of the trial model.* 


* For more details, see R. P. Feynman, Phys. Rev. 97, 660 (1955). 


CHAPTER 9 


ELECTRON GAS IN A METAL 


9.1 INTRODUCTION: THE STATE FUNCTION 


The actual behavior of a metal is very complicated: The metal electrons interact 
with the lattice, with the lattice vibrations, and with one another. As a first 
approximation, we will assume the lattice to be rigid and will neglect the mutual] 
electrostatic and magnetic interactions, except insofar as each electron is 
affected by some average potential due to the other electrons and the periodic 
lattice. 

Hence, the Hamiltonian is given by 


hv? 
H=hyt+hyt+es+3 hy = —- —+ + V(R)) (9.1) 
2m 
and the energies given by 
hu, = eu, E= y ne, and Yin =N. (9.2) 


i i 


From the Pauli exclusion principle, 1; = 0 or 1, and the wave function of 
the entire electron gas must be antisymmetrical. At moderate temperatures, the 
electron gas is almost in its ground state. That is, the N electrons occupy the 
lowest N states—almost. For a gas with all spins in one direction (for example, 
all up or all down), the ground-state wave function is 


g = LY (-1)Pu,(PR)u,(PR,) (9.3) 
VN1? 
Xp Means a sum over all permutations, an even permutation having weight | 
and an odd permutation, weight —1. PR,, which might better have been written 
Rp;, means a permutation of the electrons. The normalization factor 1/V N! 
will be derived later. 
With these definitions of }, and PR,, it is seen that Eq. (9.3) can be 
written as 


(9.4) 


a) 


ad ™ | ae ee ia 
= oa x Det [u(R,)] = ——= ua 2) u(R2) | 


Ini 
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Suppose that V(R) = 0. Then (reviewing results obtained in Chapter 1), we 
have 


272 iK-R 
jee” aa: RSE (9.5) 
2m VV 
The energy is* 
h?K? h?K? d3?K 
a OI aELNG 1G 6 9.6 
» » 2m >{ 2m (2n)3 a) es) 


V = volume and F(K) = 1 if |K| < Ko, F(K) = Oif |K| > Ko, where fiK, is 
the maximum momentum, that is, Ey) = 4?K2/2m = Fermi level. 


Ko B22 2 2 
py RS ae UN he 
o 602m (2n)3 5 2m 2n? 
Ko 4 K2 (9.7) 
Nev de eRe 
o (2n)° (2x) 
Thus, if pp) = N/V ande = E/V, 
3 WK2 1 # ave 
e=t = — ——~ (6n7)9/3 98/3 = ape!3, 9.8 
57° on, 5 dma? Y Po Po (9.8) 
It is sometimes convenient to define rg and r, so that 
1/p9 = volume per electron = 42/3 r3. 
= T/A 
where 
dy = h?/Zme? = Bohr Radius. (9.9) 


r, is of course dimensionless, and for most metals, r, is between 2 and 6. The 
energy is often expressed in Rydbergs where 1 Rydberg = me*/2A?. 

The results of Eqs. (9.7) and (9.8), plus, of course, many more, can be ob- 
tained from ¢ of Eq. (9.3). First the normalization factor appearing in Eq. (9.3) 
will be derived.t| We require [ y*p d*R, d°R,:-:d°R, = 1. Consider a 
specific term in the summation of Eq. (9.3), without any normalization factor. 


Such a term is, for example, v,(R)u2(R,)-+-. Because the u; are orthonormal, 
ut(R,)u3(R2) * ++ uy(R2)u2(R,) +++ d°R, d°R, +++ = 0. 
Also, 
[ee oute) ++ 4,(Ry)u,(Rz) °°: d?R, d*? R,+-+ = 1. 


* We assume periodic boundary conditions. 


t The factor 1/N! did not appear in Eq. (6.125) because th enormalization there was 
I/N! [o* 9 d3R,---d3R, = 1. 
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In other words, 
| om@uatR -+- d3R, d?R,-+- = 1, ete. 


There are N! terms of the type u,(R,)u.(R,):-- and thus f p*g a?%R = N! 
for an unnormalized @. When u,(R) is normalized, the normalization factor is 
therefore VN. 
The kinetic energy T can be calculated in the same manner as the 
normalization integral. 
hv? hv? 
T=(9D- —@)=—- | o* Yate aR. (9.10) 
i 2m 7 2m 
Consider a single term of ~, which is now assumed to be normalized. Such a 


term is 1/\N! u,(R>)u>(R,)--+ and (dropping the 1//N}), 
—HWV7/2mu,(Ry)u(Ry) +++ = uy(Rz)u3(R3) ° + -(—#°V2/2mu,(R,)). 
Although it is not a priori evident that 
(—#?V7{/2m)u,(Ry) = 22u2(R;), 


where 2, is some eigenvalue, this must be so since electron number one has 
nowhere except u, to go. That is, V7 operates only on u,(R,) and does not 
bother other electrons. The argument used to evaluate the normalization 
integral now may be used, and since 


Since V = 0,* the kinetic energy T equals the total energy E and the u; are 
plane waves. Hence 2, = h?K?/2m and 


E_ me. (Ee aK, WK? 


FK = ~——* 9.11) 
vf lm am ome ~ 3°? “an 
Equation (9.11) is the same as Eq. (9.8). 

The energy per electron = 3(#7/2m)Kj = 2.22/r2 rydbergs, where Eq. (9.9) 
has been used. 


9.2 SOUND WAVES 


It is interesting to ask what the velocity of sound cg, is in an electron gas. It 
can be found by introducing a perturbation of density p = pay, + (dp) cos kx 
and determining how it propagates, or more directly by calculating c2 = 6P/ép 


* Try not to confuse V = volume with V = potential. 
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where P is given in Eq. (1.53) and p = the mass density (not the number density 
Po). Then 


Co = f@ gs SG a, SEO, (9.12) 
6p J/3m V3m 
The velocity of an excitation 
1 Ge _hKy _ Po 
4 héKy m m 


5 VS te (9.13) 


where P, is the Fermi momentum (not the pressure). 

Equation 9.12 is a general result; if the velocity of a gas particle is a constant, 
C, the velocity of sound is Ch 3. For, suppose P is the pressure of the gas, 
g its internal energy density and p its internal momentum density. Then, to 
first order in the sound perturbation, we have, using standard hydrodynamics, 


p= —-VP 


é = —C?V°p (energy conservation). 
But for constant velocity we also have P = $e. Combining these equations we 


obtain 


2 
fa Ss = 0; 
3 


which represents a wave propagating with velocity C3. 

The perhaps startling conclusion may be drawn that sound propagation in 
the collisionless electron gas is impossible. In other words, the energy of a den- 
sity disturbance will be dissipated very quickly. This may be seen as follows: 
Consider a Fermi sphere (Fig. 9.1). 


Fermi surface 


Fig. 9.1 A Fermi sphere. 
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A given excitation of momentum K can increase the total energy of the gas 
by as small an amount as we please (for instance when an electron is excite 
from A to B tangentially to the Fermi surface). Thus a great many energy leve|s 
of excitation exist for a given momentum increase. These energy levels are 
below the energy of a sound wave and hence the sound-wave energy will be 
quickly dissipated. 


9.3 CALCULATION OF P(R) 
We now remove the restriction V = 0. Let 
V= pS V(R;, R;). 
ij 


The reader can easily verify that 
(9.2 vce Rio) = F | tute yess) ~ wr WFR) 
ij ij 


x V(R,, R2)[u(R,)u(R2)] d?R, d?R3. 
(9.14) 


What is the probability, P(R), of two electrons (of same spin) being a given 
distance R apart? 


Pla, b) = (9, YH, ~ «) 4; - by) 


is the probability that there is a particular at a and one at b. This is of the same 
form as Eq. (9.14), so 


Pla, b) = ¥) {lu(@)l?lu(b))? — [uM@u@][ufGu(b)]}. 9.14) 


Take V = 0. Then ux ~ e€'4/\/V, and Y; > § d?KV/(2n)°, so that 


Ef (ORML gent ecu FOORD 

P(a, b) = {| (on) [i -—e 1 N2 
ae iK-(a-») F(K) d°K|’ (9.15) 
“Pelle N (np ' 


P(R) = [are d>bP(a, b) (a — b — R) 


4nR? 3 : 2 
= -— )1 —| ——. (sin KoR — KgRcos KoR (9.46) 
rt — | gape (i KoR ~ Kok cos Kor) || 
Equation 9.16 is an important result. For example, it tells us that the hard-sphere 
model of a one-spin gas should be easy to formulate, since under most conditions 


the electrons are not very close anyway (P(R) = Oat R = 0). The short-range 
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P(R)/R? 


ie 


oO 


Fig. 9.2 Probability P(R) of two electrons (of same spin) being some distance R apart. 


Coulomb interactions are now understandably not important—although the 
long-range ones are. It is also clear that Eq. (9.14) could have been written as 


(+ LVR, je) = | V (a, b)P(a, b) da d*b. (9.17) 


The mean potential energy is the integral over a and b of the product of the 
potential energy between two particles at a and b and the probability of two 
particles being at a and b. 

Suppose that one wants to know the number z of electrons that are in some 
region R. 


n= 2 R(R)) = >, [oe — a)R(a) d’a = ox d3a 


where 
1 inside R 


R = 
” ‘0 outside R, 


and 


pa) = ¥ &(R; — a) 
is the density of electrons at point a. 
Fluctuations can also be handled. For example, using the definition of 
P(a, b), we have 
(9, n°9) = G ¥ [ 50k, — a) 5(R, — 6) R@R() d?a ‘*b0) 
ij 
= [RoRwre, b) d?a d*b. (9.18) 


Pe = [erm dia 
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or 
px = Dek ® = POR, — aX" dea. (9.19) 


Clearly (px) = 0 for K # 0. But by squaring Eq. (9.19), we derive 
(\p3l) = (@. lpxl?9) = [erent b) d?a db = VS(K), 


where S(K) = | e'* ®P(R, 0) d°R = Fourier transform of P(R,0). From 
Eq. (9.15) we see 


S(K) = LAs me F(L)F(K — Hess K #0. (9.20) 


Recalling that 
F(K ~ L) = 1 if |\K — L| < Ko, 
0 if |K — L| > Ko, 


we can show that S(K) is the shaded volume in Figure 9.3. Calculating this 
volume, we obtain 


3 3 
Se ON at ee for K<2Ky, K #0, 
4nK3 4K,  16\Ko 


S(K) = 0,  K > 2Ko. 


Problem: Calculate <|px|?) for sound waves. Are the fluctuations here 
greater or less than the fluctuations of the real electron gas? 


(9.21) 


9.4 CORRELATION ENERGY 


Consider an electron gas with the background of uniform positive charge. The 
determinantal wave function ¢ for electrons (Eq. (9.4)) leads to the exclusion- 
volume radial distribution function P(R) plotted in Fig. 9.2. The Coulomb 
energy is then calculated as 


e _ e oe 4ne* 3 
(25 @) fq rman (x 8K) (9.22) 


Fig. 9.3. Intersection of two spheres, radius Ko, centers a distance K apart. 
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where S(K) is the Fourier transform of P(R). Using the form of S(K) 
in Eq. (9.21), we calculate Eq. (9.22) as 


2 
(» y- °) = — 0716 iv abergs. (9.23) 
i r 


iy Vij ¢ 


Here r, is defined in Eq. (9.9) as r, = ro/ay with ag = the Bohr radius and 
(4x/3)r¢ = the volume per electron. 
The total energy of this system is usually written as 


E = 2.22/r2 — 0.916/r, + ¢, rydbergs. (9.24) 


The first term is the kinetic energy calculated in Section 9.1. ¢, is called the 
correlation energy, but actually it is the difference between the true energy E 
and the first two terms on the right-hand side of Eq. (9.24). ¢, is also sometimes 
called the stupidity energy E,. 

There are attempts to compute ¢.. When the electron density is low and r, 
is large, we see from Eq. (9.24) that the kinetic energy is negligible compared to 
the potential-energy part. When we distribute the electrons in a regular array, 
the body-centered cubic structure gives the lowest potential energy, which is 

Exo = — se rydbergs for large r,. (9.25) 


pot 
rs 


Comparing Eqs. (9.24) and (9.25) we see 


Ly large r,. (9.26) 


Cc 


s 


Wigner examined the case of small r, also, and his best guess is* 


& = is rydbergs. (9.27) 
r, + 7.8 

When Eq. (9.26) was calculated, electrons were put on a body-centered 
cubic array with the wave function taken to be a 6-function. We may get a 
better estimate of energy if we use a different wave function. A calculation was 
done assigning a Gaussian distribution for an electron around a point of the 
body-centered cubic lattice and varying the shape of the Gaussian distribution. 
A reasonable numerical result was obtained. 


9.5 PLASMA OSCILLATION 


In a conductor, electrical resistance increases when impurity ions are introduced, 
the extra resistivity being due to the scattering of electrons by the extra positive 
charge. However, the resistivity calculated from the scattering is much larger 


* See Wigner, E. P., Phys. Rev. 46, 1002 (1934); Trans. Far. Soc. 34, 678 (1938). 
Also see, for correction, footnote in Pines, D., Solid State Physics, 1, 375 (1955). 
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than the observed value. This indicates that the extra positive charge is partly 
neutralized by the higher density of electrons gathered around the positive 
charge. In the Fermi-Thomas model, for example, the variation of the electron 
density is related to the (slowly varying) potential g by 

én _306Ep _ 3e9 


a PB OES 


where E, is the Fermi energy. Thus 


2 
div D = div E — 4ne6n = hg _ 6nne | iis 
Ey 


so that the dielectric constant is 


2 2 
e(k) = ke + vane [Ep 
Obviously, this means that the field of a point charge decays to zero over a 
distance of (E,/6nne*)'/? (the Fermi-Thomas wavelength), and so does én in the 
neighborhood of the point charge. The point charge is therefore screened (if 
positive) by the electrons, which form a small cluster around it. 

The above applies also to each one of the individual electrons in the con- 
ductor, én being now negative, so that each electron creates to himself a ‘‘hole”’ 
in which it moves. This is beside the effect of Fig. 9.2. The same kind of 
fluctuation of electron density causes the plasma oscillation. It can be calculated 
as follows: 


Let us consider a one-dimensional sinusoidal fluctuation in the (number) 
density of electrons: 
P = Pay + (dp) cos kx. (9.28) 


The potential energy arising from the change of local density is written as 
1 ( de 
E, = = | — (6p) cos? kx dx, 9.29) 
15 | ap? (5p) ( 
where ¢ is the energy of the Fermi sphere, which depends on the density, From 
the theory of sound we know that the velocity of a sound wave cg is written as 
07e/dp? = (m/pyv)Co, (9.30) 


where m is the mass of an electron. For the plasma of charged particles, we have 
another contribution to the potential energy, namely the electrostatic potential. 
It is calculated from the Poisson equation 


—VW?2V = —4mne(dp) cos kx, (9.31) 
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whose solution is 


V = —4Ane(5p) ae (9.32) 


The electrostatic potential energy is then 


ee [cw d(vol) 


2 
eh BEER eae eae: (9.33) 
23) Re 
Adding Eqs. (9.29) and (9.33), we find the total potential energy to be 
2 
Bee Se tee SE \ Gay coat kx a (9.34) 
24 \Pav k? 
This quantity is equal to the kinetic energy of the wave in Eq. (9.28): 
2: 
Ki | eG cost ade (9.35) 
2 J Pay k? 


where q is the angular frequency in the wave: 
“u= Aeilont kx). 
Equating Eqs. (9.34) and (9.35) we may write 
c? = c2 + 4ne*no/mk?, (9.36) 
where we have put 
Cy = O,/x, 

and ny is used for p,,. For the second term of Eq. (9.36) we introduce wp, the 
plasma frequency, as 

@p = (4ne?no/m)!!?. 
Then Eq. (9.36) may be written as 

wy, = Vor + c2k?. (9.37) 


The minimum energy required for exciting the plasma oscillation is ho,, 
which is about 10 to 15 eV. It has been observed by sending a beam of electrons 
through thin films of aluminum or beryllium. 

One might wonder how this semi-classical treatment fits into the quantum 
mechanical picture of the degenerate electron gas. To see this we note that the 
simplest way of creating an excitation of wave vector g from the Fermi sea 
is by the operator aj, ,@, with a certain allowed k, such that k + g is outside 
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the Fermi sphere whereas k is inside. Since we see no reason to restrict ourselves 
to a special k, we form 


O = VY f®ahs x 
The average energy of the state O|0) is 


E= X (0 + Ok, DISA + » (va) — Vk — k)) SSR) 


(ae -*44£) 


2m 


and let us try to minimize it with the normalization condition 


¥ lf @P S11: 


allowed 
k 


Obviously, this amounts to a diagonalization of a matrix A (allowed k’s only), 
Agyr = (& — 2 + Ok, 9) Onur + VQ) — Vk KF’). 


For large q, 0(k, g) seems to control the result so that the favorable excitations are 
electron-hole pairs, f(k) = 6,,,. But when q is small enough, the long range of 
the Coulomb potential suggests that V(q) controls the result, so that the min- 
imizing f(k) is f(k) ~ const. Then the favorable O is 


0) ms >» Aedes 
k 


which is exactly the operator for a density fluctuation with wavenumber gq. 
We therefore conclude, by this qualitative argument, that for small gq, 
density fluctuations should play an important role. 
Let us now see how we can make the microscopic picture more exact. 


9.6 RANDOM PHASE APPROXIMATION 
Bohm and Pines discussed the plasma oscillation in the following way. The 
Hamiltonian is 
_ #2 2 
Poy at yeas (9.38) 
7 2m 2 iF 5 Tj; 


The potential energy is Fourier transformed and written as 


—h? 1 4ne? ditt =, 
H=Y- VP + --¥ 5 sai KRIS _ NI}, (9.39) 
Da, , Fe Pe dhe | ( 


(Subtracting N eliminates the self-energy term with i = /.) 
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We will proceed to write the equation of motion of density fluctuations. 
The Fourier components of the density operator 


p(x) = ‘y o(R; — x) (9.40) 
are 
Pe = Dew®, (9.41) 
Let us examine the Heisenberg equation of motion of px. 
The equation of motion of px is calculated as follows 


Ke(P, + K/2) 
m 


Px = (Apx — pKH) = iy ene (9.42) 


where P, is the momentum operator of the /th electron. In deriving Eq. (9.42), 
we used the fact that px of Eq. (9.41) commutes with the potential energy of 
Eq. (9.39), and also we used the relation 


PKR — oiK-Rip Kei Ri (9.43) 


(remember that in our units A = 1). In evaluating 6x, we note that Eq. (9.42) 
no longer commutes with the potential part of Eq. (9.39), so that we obtain 


+ ix-r, LK: (P, + K/2))? i 4ne? ix. my)-ik’- Ry IK’ -R ' 
pee a, ae ne ik RIK. K’, 
i m V i mK 
(9.44) 


Now we make a crude approximation: 


yf R [K -(P, + K/2))? re K*p; Yi elk Ri = K? py 
i m 


3m 7 3m 
where p, is the momentum at the Fermi surface. The “3” in the denominator 
comes from averaging over three directions. 
For the second term of Eq. (9.44), we separate the K’ = K term from 
K’' # K. The former gives 


Pr, (9.45) 


2 2 

4ne* N yi eik-R — 4ne*n ie (9.46) 
m VG m 

where n = N/V is the number of electrons per unit volume. For K # K’, 
we may write 

2 
ARE K- Ke RS gMK- RR (9.47) 

t 


- 
Kj mK’ 
K'#K 


Bohm and Pines showed that this term is small for the relatively high-density 
case. When we neglect this term we call the result the random phase approxima- 
tion. This approximation is interpreted physically as the fact that the sum >, is 
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small if R, are distributed over a wide variety of positions, so that the various 
components making up the term tend to cancel. It should be emphasized that 
Bohm and Pines proved this approximation rather than simply assuming it. 
After these considerations, Eq. (9.44) simplifies to 
bx = —®xpx (9.48) 
with ; 
OK = p7K?/3m? + 4ne?n/m. (9.49) 
The last term gives the plasma frequency 


w2 = 4ne*n/m. (9,50) 


9.7 VARIATIONAL APPROACH 


The plasma state can be investigated using a variational method. It can be 
regarded as the state of free electrons modified by plasma modes. Thus, we 
may use as a trial function 


Ptrial = €XP |- y caonl | Free» (9.51) 


where px is the Fourier component of the density operator 
= eiK Ri 
PK » 


In Eq. (9.51) a, is the variational parameter. When we minimize the energy we 
find that Bohm and Pines’s treatment corresponds to the following choice of «x: 
2 wos 
oe a for K< K ritical 
ax = ho pK (9.52) 


0 for K > K critical 


Thus, the ground state for the plasma oscillation is 


2 2 
gy = exp |- > ie | PF ree (9.53) 


|kI<Ker AO pK 7 


Variational calculations done on functions of the above type* give correlation 
energies which were slightly larger for small r,, but coincide with Wigner’s 
values for r, 2 4. These correlation energy values are also larger than values 
obtained by modifications of the RPA theory for larger (and more realistic) 
densities. + : 


* T. Gaskell, Proc. Phys. Soc. (London), 72, 685 (1958). 
+ J. Hubbard, Proc. Roy. Soc. (London), A243, 336 (1957); P. Noziéres and D. Pines 
Phys. Rev. 111, 442 (1958). 
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9.8 CORRELATION ENERGY AND FEYNMAN DIAGRAMS 


We will be concerned next with finding the correlation energy of an electron gas. 
Although there is an easier way to do this (to be shown later) the present method, 
based on a paper by Brueckner and Gell-Mann* has the advantage of exempli- 
fying the Feynman-diagram procedure. The diagram method has proved very 
valuable and has wide application in quantum electrodynamics. Indeed it was 
for this latter field that the diagrams were invented (see Chapter 6, Section 12). 

The problem can be stated as follows: In a metal electron gas, the electrons 
interact via the Coulomb potential. The Hamiltonian is given by 


h? 2 
H - > V: ties. 2 (9.54) 
We wish to find the correlation energy AE due to the Coulomb interaction. 
Equation (9.54) can be converted into creation and annihilation operator 
language. Let a} be an electron-creation operator, that is, a} creates an electron 
of momentum P = AK with wave function e'*'®. Then ap annihilates an elec- 
tron, and as usual apapz + ap@p- = dpp. With the methods of Chapter 6, 
Section 8, Eq. (9.54) can now be written as 


p? 2ne*h? 
H = ¥ — apap + 
om oe V 


=H, + Wy, (9.55) 


where the @-dependence may be obtained formally by Fourier analyzing 
Eq. (9.54). However, Q is nothing more than the momentum transferred by a 
Coulomb interaction; 42e?/Q? is the amplitude of such a momentum transfer. 
Several simplifications are possible. Chief among these is the introduction of 
band b*, the annihilation and creation operators of holes. At T = 0, all states 
less than the Fermi level ¢g = P2/2m are filled. We then define b* and 6 as 
follows: 


1 
> 5 Q? 22? » 4P,-94P,+94r,4P, 
P2 


Ap = Ap if |P| > Po 

ap= bp if |P| < P. 

Cieiat 2 ? (9.56) 
ap = ap if [P| > Po 


at =b_p_ if |P| < Po. 


In other words, if an electron below the Fermi level jumps above the Fermi level 
we say that a hole and an electron have been created, and so on. We also have 


* Some references related to the correlation energy are: D. Pines, Solid State Physics 
I, 367 (1955). D. Bohm and D. Pines, Phys. Rev. 92, 609 (1953). M. Gell-Mann and 
K. A. Brueckner, Phys. Rev. 106, 2, 364 (1957). D. Pines and P. Noziéres, The Theory of 
Quantum Liquids, W. A. Benjamin, Inc., Menlo Park, California, 1966. 
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Py = hijuro, where V = N(4x/3)r3 defines ro (N is the number of electrons, V is 
volume of the gas) and « = (4/9z)'/. The momentum, P, will be expressed in 
Fermi momentum units (Py = 1) and the energy in rydbergs. 

From ordinary perturbation theory 


AE = <O|H,|0) aie Y (A onl 1)no He (Ay om A 1 mn FF no 
n#0 Ey —E, #0 m#0 (Ey — EmM(Eo — Ey) 
(9.57) 
Equation (9.57) may be written as 
AE “equals” (O\H,(0) + (O|H, _ | - Hyl0) 
Eo = Ho 
se 1 i 
+ <O|H; _- -H : H,|O>) +--:. 9.58 
COM, pg Hs gg, Hal (9.58) 


The quotation marks around “equals” in Eq. (9.58) are to indicate that the 
equation is valid only if a comment is added to it. This comment will turn out to 
be surprisingly simple when stated in Feynman-diagram language. We will come 
to this shortly. The notation of Eqs. (9.57) and (9.58) is familiar. That is, 
(O|H,|0) = J p*H io dx and 


(HD) mn = | ost, dx,etc., where Hoo, = E,Q,. (9.59) 


Before continuing, let us try to see the “sense” of Eq. (9.58). First, it is easy to 
demonstrate the (near) equivalence of Eqs. (9.57) and (9.58). By the ordinary 
rules of matrix multiplication, 


1 1 
OG. S22 ei eh oe 
Ol “Ey — He 110) ( : Ho 


0 


1 
= Y (Hom G os 1) ftom (9.60) 


But, excluding pathological cases, 
i ORS Ho)y dx = f(E,) | 0800p dx = Sguf(Ey). 


Hence 


esol Lay eee 
Eq — Ho) mn "" Ey — Ey 


and Eq. (9.60) equals 
> (Hyon no . 


a Ea >-E: 


9.8 Correlation energy and Feynman diagrams 257 


The comment to be added to Eq. (9.58) is that when we multiply out all matrices 
we exclude terms with n = 0. One could express this more precisely by writing 
(1 — |0><0})/(Ey — Ho) instead of 1/(Ey — Ho). 

Second, it is possible to understand more directly why Eq. (9.58) represents a 
perturbation expansion. This will not be demonstrated here except to say that 
the key to the demonstration is the matrix identity convergence* 

1 1 1 1,1,1 


: =—-—B-+- B-B--:::, (9.61) 
A+B A A A A AOA 


We return now to the mainstream of our problem. Using Eqs. (9.55) and 
(9.56), we can illustrate the possible fundamental processes allowed by A, 
by the set of diagrams in Fig. 9.4. A downward arrow represents a hole and an 
upward arrow an electron. The space below a dotted line stands for “before 
the interaction” and above, ‘‘after the interaction.” 

Some comments on Fig. 9.4 may prove helpful. Process (a) is the only one 
that can start from vacuum or the ground state. The interpretation of (a) is 
that an electron and hole are created, the difference in momentum Q is carried 
along the dotted line, and another electron-hole pair created, the momenta 
being such that the total momentum is conserved. In general, each diagram 
junction represents a two-particle interaction with conservation of momentum. 
As another example, (c) describes the annihilation of one electron and the 
creation of two electrons (of different momenta) and a hole. 

To demonstrate how Fig. 9.4 can be used to calculate the desired matrix 
elements in Eq. (9.58), we first focus on the second-order term, 


1 
(OH, 


0 0 


H,|0) = e®). 


The first operator, H,, operates on the ground-state wave function and thus the 
first operation must be (a) in Fig. 9.4. 

The next H, operation must get us back to the ground state, because 
{O[4|0) = Ounless Ago = ago. (The (Ey — Ho)~' operation does not change 
the state.) All the possible operations are given in Fig. 9.4, and it is quickly 


* Equation (9.61) can be proved as follows. Write 

Get ait 
A A 
Multiply this with A from the left: 


1 — BS, 


i 


ne en rae ee eee 
AO AA 


(A + B)S 


i 
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@) 


Fig. 9.5 The operations of figure 9.4a and b added together. 


seen that only (b) added onto (a) can return us to the ground state or to vacuum 
conditions. See Fig. 9.5. 

Before discussing the amplitudes or numbers associated with each diagram 
of Fig. 9.5, let us pause to look at some third and fourth-order diagrams-— 
Figs. 9.6 and 9.7. Figures 9.6(a) and (b) are both composed of the fundamental 
diagrams shown in Figs. 9.4(a), (b), and (e). 
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Fig. 9.6 Two third-order diagrams. 


Pitq 


Fig. 9.8 The first diagram of figure 9.5. 


When we reach fourth order a new ingredient appears. The diagram in 
Fig. 9.7 illustrates an evidently valid process. Note, however, that the diagram 
in this case consists of two independent parts—or unlinked clusters. It can be 
shown that unlinked clusters must be forgotten and not counted (because they 
involve intermediate states jn) = |0) in the expansion of Eq. (9.58)). In other 
words, the process illustrated in Fig. 9.7 is not valid and will not occur. The 
Statement that unlinked cluster diagrams must not be counted is the afore- 
mentioned comment that must be added to Eq. (9.58). With the addition of this 
Statement the quotation marks can be removed from that equation. 

Returning to the calculation of e'*? we ask, what is the number associated 
with each diagram of Fig. 9.5? First, all fundamental diagrams (Fig. 9.4) have 
amplitude 2me7h?/VO? where Q is the Coulomb interaction momentum 
(see Eq. (9.55)). Now consider the first diagram of Fig. 9.5, shown again in 
Fig. 9.8. g, P,, and P, are arbitrary but then all other quantities are fixed. 
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1 


(2) _ = 
ce?) = CO|H, E H H,|0> (9.62) 


) 


from Eq. (9.58). From Eq. (9.62), we see that the amplitude for a given g must be 


2ne*h?\? 1 1 1 
: : (9.62, 
‘vol | qq’ Ey — H, 


where |n> is the intermediate state in Fig. 9.8. Now 
ait Pig Pra Oo Pi 
2m 2m 2m 2m 


= @ + q°(P, — P,)). (9.64) 


Note that the energy associated with a hole is negative since we are measuring 
energy from the Fermi level. 1/(E,) — Ho) is called the propagator, and 
—mi[q? + (P; — P,)-q] is called the propagator factor. (2ne*h?/vol)° 
(1/q?)(1/q?) is the coupling. Thus for a given P,, P,, q the amplitude associated 


with Fig. 9.8 is 
272\2 e 
que Ge ) ae m (9.65) 
V qq?q? + (P, — P2)q 


To find the total amplitude A; (associated with the diagram of Fig. 9.8) we 
must sum over P,, P,, and q, thus: 


A re ‘ i —m d°q_ V dP, d} Py 
: ggg? +(P, - P,): q(2nh)3 (2nh)? (nh) 
(9.66) 


The integral is such that if R stands for the Fermi sphere, then P, is in X. 
P, in R, and P, + gand P, + q are outside R. Summing over electron spins 
gives us a factor of 4, two for each loop. 

Consider next the second diagram of Fig. 9.5, shown again as Fig. 9.9. 


Fig. 9.9 The second diagram of figure 9.5. 
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An examination of Fig. 9.9. enables us to write down the amplitude 
jmmediately: 


ee {| he cae ; ae =m 
z VV Ja\ Vi )(Pi +4 —- P2)? g? + (Pi — Pr) 


ys dog dP, d°P, 
(2h)? 


FSP)A(PIU — f(Pi + MIU — f(P2 - 9)). 


(9.67) 
Here 
f(P) = ic i |P| < Po 
0 if [P| > Po. 
The introduction of the last factor (involving the f(P)) enables us to dispense 
with the comments after Eq. (9.66) explaining the limits of integration. Now the 
integrations can go from — oo to oo. The sign of each amplitude is determined 
as follows. For Fermi-Dirac statistics (the present case), the sign is —1 for 
every closed ‘‘matter” loop. For Bose-Einstein statistics the sign is +1 for 
every closed matter loop. Figures 9.10(a) and (b) are each considered as one 
closed matter loop. Figure 9.11 is considered as two closed matter loops. 
Hence the sign of the first term in Fig. 9.5 is (+) and the sign of the second 
term is (—). To sum over spins for the second diagram of Fig. 9.5 gives another 
factor of 2 in Eq. (9.67). 


YO 


Fig. 9.10 Each of these is considered as one closed matter loop. 


one 


Fig. 9.11 Two closed matter loops. 
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9.9 HIGHER-ORDER PERTURBATION 


Consider the diagram of Fig. 9.12. It should be noticed that the interaction is 
always wave vector g. The perturbing energy contributed from this diagram 
is calculated (including the sum over spins) as 


(—1)323 2ne?h?\* d*q [d°P, (d°P, d*P3 4 
(2n)3 } (2x)? J (22)? J (2n)° 
welts “ye ee 8 7 joa 
(4q? + q° P,) + Gq? + 4° P2) Ga? + 4° P;) + (q? + 9° Ps)’ 


(9.68) 


where the integration is over the appropriate range. 

Because of the existence of the powers of q in the denominator, diagrams of 
this type diverge when integrated over q, even with the restriction on the range 
of integration. However, the final quantity of interest is the sum of contributions 
from these diagrams, and the final result is expected not to diverge. In order to 
arrive at the final result without getting involved in the divergence, 1/q? is 
first modified into 1/(q? + ¢) using a small ¢, the integrations are carried out 
including ¢, and at the last stage « is brought to zero. 

The worst divergences comes from diagrams such as those of Fig. 9.13. 
These are called the sausage diagrams, and the q’s are the same for all inter- 
actions shown by dashed lines. 


Consider the integrals 


F(t) = Ee “i F(p)(1 — F(p + qe "2 +4a-p) (9.69) 
and 
4=il =) di ood, 6G, Gh Hee 


2 — 0 


x Fa(ty)F (ty) +++ Fa(t,)- (9.70) 


The simplest case of A, is 


eo io 6] 
4 | dt, | dt, ke 4+ toe” H11@7/2 + 4° pig |t2l(a7/2 +4 Pa) 
wD 


-2s 


A, 


ll 


d° Py d*p, 


(2n)? (2n)3 
_ fap: [ 4*p2 4 ee eee) 
(2x)° } (2n)° 
d*p, ( d*py _ 1 
(22)? J (20)? q? + 9° (Pi + Po) 


(9.71) 
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—Pi+q 


i ~«— — 


Fig. 9.12 Diagrams of this type diverge when integrated over q. 


on oar He ee 


Fig. 9.13 Sausage diagrams. 


where all integrals are over the appropriate ranges. 
Thus we see that the contribution from an n-loop diagram is 


3 242\n 
C, = lee eA ) A,(q(~ 1)”. (9.72) 


In order to simplify 4,(q), use the integral expression of the 6-function 


oldu 
o(t) = iut 
() | — 


—ca 


so that Eq. (9.70) becomes 


du 


A [a pated tel Oe SEE (tt) eo Fe) oe 


sil= Sle 


n 
T 


ie [| aver go]. (9.73) 
2n 
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Use Eq. (9.72) in Eq. (9.71) to write the sum from all sausage diagrams as 


2ne*h?\" 
Cx —1)" 
Y “Fleas 4 (AEF) 0 


a *, (# asey Ee i ergo] 


d-q {du 2ne* h? 
ale ae oe dte™F (t 9.7 
- (6 nl om | e 0]. (9.74) 


The integral over J dt--- has been worked out for small q and is 


= ie _(d*p  2Gq? + 4°p) Z 
Q,(u) = [ee F(t) = | Ey eearerer caver F(p)[1 — F(p + q)] 


~ 4n ( ~ “tan! ‘\. (9.75) 
q u 


For small q, 


4n q? 
Qu) » a (9.76) 


Thus Q,(u) is finite for small g. 
The final answer for the contributions from all the sausage diagrams is 


2 
e= 2 (1 — In 2) dar, 4. {ln R)R* dy _ 1 (9.77) 
n? J R? dy ps 
where 
R=1-—vytan'y, 
f “ (9.78) 


a = (4/9n)1/3, 


r, is the average spacing of electrons measured in units of the Bohr radius. It 
should be noticed that a In r, term appears in Eq. (9.76). The final result for the 
total energy is 

2.22 0.916 


r2 


E= + 0.0622 In r, — 0.096 + O(r,). (9.79) 


r 


s Ss 


Here, the last constant term — 0.096 is the exchange term calculated apart from 
the diagram sum. 

It should be remembered that the mathematics in this section summed 
only the sausage diagrams as shown in Fig. 9.13. Although it is true that each of 
these diagrams causes the worst divergence, the theory would not be complete 
if the rest of the diagrams like the one in Fig. 9.9 were neglected. Gell-Mann 
and Brueckner examined the contribution from these diagrams also and —0. 096 
is the result; as was expected, it is smaller (for small r,) than the contribution 
from the sausage diagrams, but it should not be neglected. 


CHAPTER 10 


SUPERCONDUCTIVITY 


40.1 EXPERIMENTAL RESULTS AND EARLY THEORY 


Just about any material can be brought into a superconducting state, in which 
there is no measurable electrical resistance, by cooling it below a certain critical 
temperature. That temperature depends on the material; it gets as high as 
~ 21°K for Nb,,Al,Ge. Although the vanishing of the resistance is perhaps 
the most spectacular effect in superconductors, we can perhaps find clues to the 
cause of superconductivity by examining other effects. One such effect is the 
discontinuity in the specific heat at the critical temperature. 

The specific heat less the aT? contribution of the lattice gives the specific 
heat of the electrons, C,. Comparison of normal elements with superconducting 
ones gives the two curves shown in Fig. 10.1. For the normal element C, 
varies as 7, because the thermal energy is AT and only those electrons within 
an energy range AT near the Fermi surface can be excited. Thus, the thermal 
energy varies as T?. 

Integrating the specific-heat curve shows that the superconductor is lower 
in energy than the normal element. 


Superconductor 
Normal element 


1 
1 
1 
4 
1 
| 
| 
| 
1 
1 
1 
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T, 


Fig. 10.1 Specific heat of electrons as a function of temperature for a superconductor 
and a normal element. 
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Meissner Effect. A changing magnetic field produces an electric field, so 
something with zero resistance cannot contain a changing magnetic field. If 4 
superconducting sphere is placed in a magnetic field, the field lines will be forceq 
away from the sphere. If a ring is cooled in a magnetic field until it is a super. 
conductor, the flux through the ring will remain after the external source of the 
field is turned off. Apparently the resistance is exactly zero, for experimentally 
the flux remains constant indefinitely (provided the ring is not allowed to 
warm up). 

More surprising is the Meissner effect. If a solid (simply connected) piece 
of superconducting material is placed in a magnetic field and then cooled below 
the critical temperature, the magnetic field is pushed out of the superconductor, 
Technically, some lines might be trapped in the object, because some parts 
reach the superconducting state before others. Furthermore, if the magnetic 
field is strong enough, it might not be pushed out at all. In such a case, the 
material does not become superconducting. Its resistance and specific heat are 
normal. Because of its magnetic domains, iron cannot be cooled into super- 
conductivity. 

The Gibbs function is defined asG = F + PV where P is the pressure and V 
is the volume. Here the pressure can be taken to be the energy per unit volume 
required to push the field out. From classical thermodynamics, the Gibbs 
function has the property that in a reversible change of phase at constant 
temperature and pressure, G does not change. So the critical field is characterized 
by 

ul, 


F supercond oh ae V= Gsupercond = Grormal = Formals (10.1) 


At the critical temperature, H,, = 0, and as the temperature decreases Fyo;mal 
increases, so that H,, increases. 


Fi 
4 


Normal 


Super 


Fig. 10.2. Variation of critical field with temperature. 
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London observed, on the Meissner effect, that if 7 = density of electrons, 
mass = m, charge = —e and electric field = E, then acceleration is given by 


—eE = mx, 
and the current density / is 
—nex = j. 


In a magnetic field of vector potential A (if we use cgs units), 


gives a rate of change of current density of 
ee 
dt ot 
Hence, 
—M(A — A const) = ds 


where A = ne?/mc = constant. A,,, is constant in time, but may vary with 
position. It is fixed by the Meissner effect, so that no arbitrary magnetic fields 
can be put into the superconducting region. 

London proposed that A,ong be taken equal to zero for superconductors. 
We then must satisfy the boundary condition that j,4;mai = 0 at the surface of 
the superconductor, so that A, mai = 0. We do this by an appropriate choice 
of gauge, called “‘transverse gauge.” 

We conclude then that 

j= —AA. (10.2) 


Equation (10.2) implies a modification of the statement that there is no 
magnetic field in a superconductor. Since 
4n. 
Vasa FSG. (10.3) 
c c 
there is no sudden drop in the magnetic field to zero when we enter the super- 
conductor. In one dimension, for example, 


A o exp (+ V4nAJe x), 


where the sign in the exponential is chosen so that A decreases as the distance 
into the solid increases. The magnetic field penetrates to a depth of order 


700 A (J c/4A). The existence of a finite penetration depth can be determined 
experimentally by measuring the diamagnetic susceptibility of small drops of 
superconducting material, or by working with thin films. 

The above theory suggests where the A might come from and approximately 
how big it should be. But remember that electrons in a metal are not free, so A 
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is not exactly ne?/mc. The theory can be made to correspond with reality better 
by taking j = —AA’, where 4’ is A averaged over the position using an appro. 
priate function. For example, in one dimension the average might be taken as 
follows: 


A’ = normalization constant x [aces dy. 
For “‘hard”’ or ‘‘type II” superconductors, & is much smaller than the penetration 
depth. For “‘soft” or “type I’ superconductors we have large €. Impurities make 
a superconductor hard. 


A further contribution by London came from consideration of the quantum- 
mechanical electric current, which is —e times the probability current: 


= =" yevy — (WW)*Y). 
2im 


In a magnetic field the momentum operator becomes p + e/cA, so that the 
current is then 


j= ni (Fy $ “\y + (; v4 4) uy) v\ 
2m i 
—4 2 
= 5% tyrwy - wry] - <4 vy. (10.4) 
im mc 
With A = 0 we get 
J = 5° WS — (Vo)*V0] = 0. 
im 

When A ¥ 0, Wo changes into w. If wy = Wo, then Eq. (10.4) gives 


2 
j= —-— wy = --— A= -AA. (10.2) 


We see then that if the wave function is “rigid” (that is, if it does not change 
when A is introduced) then Eq. (10.4) implies Eq. (10.2). What could cause this 
rigidity? 

From perturbation theory, we have 


Y= vo + ye In 


where Ey is the ground-state energy and E&, is the energy of an excited state. If 
there is a gap between the ground-state energy and the energy of the first excited 
state, then E, — E, is large and wy = Wo. 
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Fig. 10.3. Variation of energy gap A with temperature. 


The assumption of an energy gap can also explain the anomaly in the specific 
heat. Instead of the energy varying at T?, a gap would cause it to vary as 
e~4/kT where A is the size of the gap. More direct confirmation of the existence 
of a gap has been afforded by experiments involving microwaves. The energy 
needed to excite the material across the gap could even be measured as a function 
of temperature, and A decreases as the temperature increases. 

It took almost fifty years for the problem of superconductivity to be 
reduced to that of explaining the gap. In what follows we will explain the gap 
following the theory of Bardeen, Cooper, and Schreiffer. This theory is essenti- 
ally correct, but I believe it needs to be made more obviously correct. As it 
stands now there are a few seemingly loose ends to be cleared up. 


10.2 SETTING UP THE HAMILTONIAN 


The energy gap is A ~ kT, ~ 10~* eV/electron, which is a small quantitative 
effect to produce a large qualitative effect. Thus the Coulomb correlation energy 
is too big, and can therefore be neglected. 

If this reason for neglecting Coulomb effects seems odd, remember that we 
are not trying to explain and predict everything about the solid. We are just 
trying to understand superconductivity. If some effect is associated with an 
energy that is too large, then we know that effect is not the cause of super- 
conductivity. Similarly, as we know that the energy associated with super- 
Conductivity is small, we can predict that certain phenomena (such as e* e7 
annihilation) are not going to be affected much by superconductivity. 

If we change the isotope out of which the superconductor is made, the 
Critical temperature changes. Spin-spin and spin-orbit interactions do not 
change with changes in the isotope, so they should be neglected. 
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The speed of sound is definitely dependent on the mass of the atoms of the 
superconductor. It is plausible then to guess that superconductivity has some. 
thing to do with interactions between the electrons and the phonons. Let ys 
try, then, a Hamiltonian of the form 

H = Vi exagay + Y oxCg Cy + » Mxx-Cx-K4K: Ax 
K K K,K’ 
+ ba Matx'Cg— xx Ax: (10.5) 
K,K’ 


ax creates an electron, Cg creates a phonon. é, is the energy of an independent 
electron of momentum K, and w x (f = 1) is the energy of an independen: 
phonon. We measure the energy of the electrons from the Fermi surface, and 
holes are treated as electrons with negative ¢. The last two terms of Eq. (10.5) 
represent the interaction between phonons and electrons. Usually this inter- 
action is taken to be responsible for resistance. High resistance at normal 
temperatures implies high M, which in turn implies a special propensity towards 
superconductivity. 

There are many effects that cause large energy changes, but which are easily 
understood and have nothing to do with superconductivity. For example, the 
effect represented by Fig. 10.4 changes the energy of the electrons by an amount 
larger than the gap, but does not produce a gap. 

The diagram in Fig. 10.5 just corrects the properties of the phonon. 


Fig. 10.4 An effect that causes large energy changes. 


Fig. 10.5 An effect that corrects the properties of the phonon. 
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Fig. 10.6 A two-electron process. 
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K, 
(ex,) 


K, 
(ex,) 


Fig. 10.7 One electron distorts the lattice, the other is affected by the distortion. 


We see, finally, that the superconductivity must involve more than only 
one electron or one phonon. We must consider diagrams such as that in Fig. 10.6. 
Physically, this diagram can be interpreted as a consequence of one electron 
distorting the lattice, which in turn affects another electron. Figure 10.7 shows 
that two electrons K, and K, interact via a phonon, causing K, and Ky; to 
come out. For the momenta the relations are 


Ki — ky 
K, — K; 
The initial energy is Einiiaa = &x, + &x, 
Virtual energy is 
E 


virtual 


in 


= Q, 


QO. 


the intermediate region, and the 


= &q, + &, + h@g. 


The perturbation energy due to the mechanism of Fig. 10.7 is 


I 


V, 7 


Mages 5 
Evnitial — 


= Mak; 


virtual 


1 


(x, + €x,) — (x, + &q, + hag) 


* 


Kik, 


Meg: (10.6) 
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Ky 


Fig. 10.8 Another two-electron process, of similar type to Fig. 10.7. 


Figure 10.8 shows another mechanism of a similar kind. The perturbation 
energy coming from this mechanism is 


1 
(éx, + &,) — (&x, + &x, + h@g) 


V, = Mux; Mix (10.7) 
For the calculation of the perturbation energy, the initial state and the final 
state have the same energy: 


&x, + &, = &x, + x; or &x, — x, = —(Ex, — &x;)- 


The perturbation energy is the sum Vx,«5-x,«,, We are mainly concerned 
with electrons near the Fermi surface, so we can take all ¢’s to be approximately 
equal. Then 


Varina, © ~~ (MaasMiix, + MauxiMiia)- (108) 
Q 
If the M’s are approximately equal, the perturbation energy is negative; so the 
electrons near the Fermi surface attract each other. 

In metals, the Fermi surface is curved. If we disregard the curvature of the 
surface, the Hamiltonian can be treated exactly. However, when we do that we 
find no superconductivity. Therefore the curvature of the Fermi surface 1s 
essential for superconductivity. 

The Coulomb interaction among electrons gives the interaction energy: 


4ne? 


: (10.9) 
|K, — Kj|? + (const)? 


Veoulomb = 


which is always positive, so the interaction is repulsive. The “(const)?” is 3 
consequence of shielding. In order to achieve overall attractive interaction 
between electrons we need 


V + Veoulomb < 0. 


10.3 A helpful theorem 273 


K; <— —> K, 


kK, ———_———> <+_———_ Kn 


Fig. 10.9 A case where |K, — Ky] is large. 


Looking at Eqs. (10.8) and (10.9) we see that for overall attraction when 
&x, © &;, larger values of |K, — K;| are more favorable. For example, look at 
the case shown in Fig. 10.9. 

We can summarize these results by writing a new Hamiltonian for the 
electrons alone. The phonons have the effect of modifying the ¢x for the electrons 
and of modifying the interaction between the electrons so that the interaction is, 
under certain circumstances, attractive. We will be working with the 
Hamiltonian 


= a5 + + 
H=Vecanag + Ye Vx w14K,9K; 4K, OK: (10.10) 
K KiK%;K,K2 


Our problem will be to discover how Eq. (10.10) can lead to a ground state with 
especially low energy. 


10.3 A HELPFUL THEOREM 


Consider a Hamiltonian 
A = Ao + U. 


The eigenvalues and the eigenfunctions of Ho are written as E; and @;. 
U;; = (9; Ug;). 


If all the £;’s are nearly equal, and also all the U;,’s are nearly equal, then a 
large amount of lowering of energy can be achieved. This is shown as follows: 
Try a wave function 


y= 2 AQ; 
and evaluate the energy expectation value for this state: 
é=) Ejal* + ¥ U,;a7a,. 
i ij 
Following the assumption let us put 
E; = Eo, U,, = —V. 


The normalization of the a,’s is 


> |a,|? = 1. 
Then 
€ = Ey - VY afa,. (10.11) 
i 
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Suppose there are m states of the required nature; what is the best choice of 
the a,’s that makes € of Eq. (10.11) a minimum? It is* for 


= 1)/m, 
when Eq. (10.11) becomes 
€= Ey — mV. 


Thus, when V is positive, or U,, is negative, the stabilization is m-fold intensified, 


10.4 GROUND STATE OF A SUPERCONDUCTOR 


With the theorem of the previous section as a guide, let us proceed to find a set 
of such y,’s for which the U,,’s are nearly equal and negative, and the £;'s 
are all nearly equal. 

In the k-space, a wave function is defined by the configuration of occupancy 


and vacancy of states. For U,,, consider 


(Kegs Kegs oo os Kis G|V as Kgs ss Kis ea), 
where 
+ a+ 
Vi = Vi nettesiteites eos Me, Ue 


We can consider this sum to be restricted to, say, ki > ky and ky > ky, 
Remember that a, ,4,, = —4,4,4,, and aj;a,; = —ajja;,. The two states con- 
sidered differ only in ky, k; and k,, k,. This matrix element is 


(Kay Kgs oes Ry ol hear Kegs «es Kis Ha) = EV eserstinge (10.12) 


The + sign comes from the ordering of k’s, and the rule is easy to obtain from 
Fermi statistics. The sign is (—1)%**’ where N = number of states between k, 
and k, in the function describing the initial configuration, and N’ = number of 
states between k, and k, in the function describing the final configuration. 
For example, 


(kas b> Ks, KAIV\Ry, k,, Kp, ky) = (= 0 aid Serer er ae 


Bardeen, Cooper and Schrieffer proceed to show that if a particular set of 
configurations is chosen, we can make Vj; 4; 4,4, in Eq. (10.12) all real and nega- 
tive, and the sign in front of it always positive. The idea is to take the states in 
k-space always in pairs. This means that the pair of states k, and k are either 


* To prove this, use the Schwarz inequality: 
ta = |Z1-a) < EPL lee =m 
i i j 


with equality if (a,,a@),...) o (1,1---), that is, if the a,’s are equal (and may be 
chosen real.) 
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Fig. 10.10 k-states arranged with pairing states side-by-side. 


both occupied or both empty, and the configuration in which one of them is 
occupied and the other empty is not allowed. When we make this requirement 
on the configurations we can show that the sign in Eq. (10.12) is always positive. 
Suppose all the k-states are arranged in such an order that the pairing states sit 
side by side as shown in Fig. 10.10. Using the rule (—1)%*%’ and the fact that 
N = N’ = 0, we see that the sign is always positive. 

Because Vy; 4,4, Should not vanish, we require for every pair 


ki + ky =k, + ky, = p = const. 
This constant p can be chosen as any vector, but for the ground state we choose 
p= 0. 
Thus, the pairs we consider are 
k, = —k,. 


How about spins? In computing the potential, we included the diagram in 
Fig. 10.12a but neglected Fig. 10.12b, plus and minus. The sign is + for states 
that are symmetric in space, that is, for states that are antisymmetric in spin. 
To make the potential as large and negative as possible, we take the states with 
antisymmetric total spin, that is with total spin equal to zero. We choose 


Oo, = —G,. 


3s 


/ 
Xx 


Fig. 10.11 Pairs in the k-space. 
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(a) (b) 


Fig. 10.12 Included and excluded states in the potential calculation. 


To simplify the notation, the pair (kT; —kl) is denoted by ‘k” in this 
section. Consider all the pairs in the k-space and arrange them in a certain order 


ky, kp, k;, one 


A pair k; is either occupied or unoccupied. 
Let us define: 


|~,(0)) = the wave function for kf and —k{| both unoccupied, and |g,(1)) = 
the wave function for Af and —k| both occupied. 


Then a wave function that is a possible candidate for the ground state is 


> = TT Wvx.. (10.13) 
where 


Wi» = U,leA1)> + Vi1@,(0)> 
and 
[Ugl? + Yl? = 1. 


The actual ground state is some linear combination of states of the form given 
in Eq. (10.13), but we will make the simplifying assumption that with appropriate 
choice of U, and V,, Eq. (10.13) is the ground state. By adjusting the phase 
of [g,(1)> and |g,(0)>, we can take U, and V, real. Our program will first be 
to find the ground-state energy by varying U, and V, = V1 — Uz. Then we 
will find the energy of the excited states, which will turn out to be a finite, 
macroscopic amount above the ground state. 


The energy of a candidate for the ground state is 
E= Cw H ly) = » CWlepan Ix + 6,447 4a_,lW) + a View SWlag a> yaya — |) 
(10.14) 
where all sums are taken over half the k’s and where V4. = Vir,—yecy,-4 Then 


E= » &KWal a Oy + a=,a_4\We> + » Vien Wiel Wil aie Oe Oy al Wa > Weed 
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Let 
Sp = Walag ay + a7 a_lWa> 

and (10.15) 
i = CWalag a7 glW,>- 


Then 
E = Y e455 + ¥ Vouicte: (10.16) 
k kk 
If 
lWy> 7 U,\e,(1)> + Vil@,(0)), 
then 


5, = 2U3, ty = U,V, = pee 


i 2 
E = ¥) 2eUG + 3 Veena Vr Uave (10.17) 


The U,’s for the unperturbed state are 
U, = 1 for k < Kgecmi- 
U, = 0 for Kk > Kgermi- 


The best choice of the ground state wave function is obtained by minimizing 
Eq. (10.17) with respect to the U,’s, V, being a function of U,. In the minimiza- 
tion it is not necessary to fix the total number of electrons explicitly. By varying 
U,, and V,, we vary the number of electrons. If we choose an appropriate zero 
of the energy €,, we wind up with the correct number density of electrons after 
minimizing the total energy. We will take the zero of the energy to be at the 
Fermi surface, so that Een; = 9. 


10.55 GROUND STATE OF SUPERCONDUCTOR (CONTINUED) 


Following our program we must next minimize Eq. (10.17) with respect to the 
U,?s. A set of equations results: 


4¢,U, + 2 y Vie UyeV ye: (¥ — a = 0. (10.18) 
k 
Because V,,, is real and negative, we can define 
A, = — > ViarUy Vie > 9 (10.19) 
and we write Eq. (10.18) as 
7 ee ea ie (10.20) 
V1 - U3 


Introduce x as 
uz =40 4+ x) (10.21) 
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so that 
1—-Uz,=10-»x 
and (10.22) 
1 — 2Uz7 = —x. 
Square Eq. (10.20) and use Egs. (10.21) and (10.22) to solve for x?: 
2 
2 & 
x=, 10.2 
E2 (10.23) 
where we define 
E, = Vez + Az. (10.24) 


Taking the right sign we obtain from Eq. (10.23): 
—(1 — 2U2) =x = —*# 
( ra) E, 


so that for e, < 0 (below Fermi level) x > 0 and U, > V,, as it should be. 
Putting this back into Eq. (10.21) we have 


(10.25) 


(10.26) 


U2 is the probability that the pair k is occupied. The form of U2 in 
Eq. (10.26) shows qualitatively that the distribution UZ is a rounded Fermi 
distribution. Assuming A, is small, we see 

Uj — | deep inside the Fermi sphere (e, < 0) 
and (10.27) 
Uj, — 0 far outside the Fermi sphere (g, > 0). 


These limits suggest the distribution shown in Fig. 10.13. 


— Unperturbed distribution 


Ux in Eq. 10.26 


€Fermi 


Fig. 10.13 The rounded Fermi distribution. 
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~~ 


Fig. 10.14 The range of nonzero V4, as assumed by Bardeen, Cooper, and Schrieffer. 


To complete the calculation we have to solve for A,. If we are to use Eq. 
(10.19), we shall find U,V,. Because Eq. (10.26) gives 


1 ey A 
UNS s ta ee ee 10.28 
Vi a E22, (10.28) 
Eq. (10.19) can be written as 
Kea Vip (10.29) 


Vane i 
Ror 2 e2, + Az, 
This is the equation to be solved for A,. 

In order to obtain some insight, Bardeen, Cooper, and Schrieffer assumed 
V4 is a constant for k and k’ within the range iiw,, above and below the Fermi 
surface, and zero otherwise. See Fig. 10.14. When we make this assumption, 
A, of Eq. (10.28) becomes a constant independent of k, and the equation for it is 


1 
w/e? + A? 
Changing >’, into an integral over ¢, we have 
V| (her M(e) de Moor de 
- BOLE iviMe) | aie, 
2| J -noor Ve? + A? o ve? + A? 


M(e) is the density of states for energy ¢; it is close to a constant M(0) near the 
Fermi surface. Solving for A, we obtain 


1=—-V 


Hey 


10.6 EXCITATIONS 


Now we are ready to find the excitation energy. To describe excited states we 
will have to consider the possibility of a given pair of momenta being half- 
occupied. Define 


@l2) = 4790), (3) = a7, 9(0). (10.31) 
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Then |y,,> is some linear combination of the four following orthonormal states: 


[W.(0)) = U,e,(1) + Vig, (0) = the appropriate |, for the ground state; 
lw(1)> = V@,(1) — U,,{0) = |W, for the excited state of a pair; 
ly,(2)> = @,(2) = |W, for a single excitation, with —k| unoccupied; 
iW.(3)> = @,(3) = |W, for a single excitation with AT unoccupied. 

(10.32) 


If Wy, = @,,(2) and y, = U,p,(1) + V,.@,(0) for k # k,, then the energy that 
we will call E’, is given by Eq. (10.16). Take Ey as the ground-state energy. 
Then s,, = l and 4, = 0, so 


E" — Ey = &, — 2e,, Uk, = » Vie (Ua ViU Vics) » ViVi, Ui Vi) 


= & (1 = 2Uz,) = 2(5 Vaal) Ui Vay 


We have used the fact that Viz, = Vitx = Vigx Using Eqs. (10.19), (10.25), 
and (10.28) we see that 


é A ez + A? 
E'-E, = “ki \ _ 2(-A,) (= |) = “He = 10.33 
ere ( 2 (—A,,) (st) z ve (10.33) 


1 


If Wa, = x,(3), 54, = 1 and %, = 0, so Eq. (10.33) still holds. If wy, = 
Vu Pui(1) — Ux, %,(0), the energy becomes E”. It is easy to show that s,, = 
Vii» te, = —Ug,Vy,, and 


E" — Ey = 2E,,. (10.34) 


Note that E,, > A,,. 
Suppose W,, = @,,(2) and Wy, = @,,(2). Then, if we are careful not to 
count V,,,, terms too often, we get 


E — Eo = Ex, + Ex, + Vena Vie UaVieg © Ex, + Eta, 
because Vi.4,U4,Vi,Un,Vq, 8 an infinitesimal quantity. Thus, 
Ewo excitations — Eo = Ex, a Ex, (10.35) 


If Wi, = Ve, n,1) — Ux,@x,(0), it is plausible to consider the excitation to be 
a double excitation, with both k, and —k, excited. 

We conclude that there is a gap between the ground state and the excited 
states, and that excitations consist of the breaking of pairs. The excitation 
energy is the sum of the E, for each excited electron in each pair. 
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10.7 FINITE TEMPERATURES 


To find the energy of a system at a finite temperature, we will use: 


Expectation value of the energy = E 


= ¥ (Probability of state i) x (energy of state i). 


Describe a state by the product [T, |W,,(7;)), or by the sequence n,, 72,..., 
where n; = 0, 1, 2, or 3. 

If P,(n) is the probability that |W,)> = |y,{n)>, then the probability of a 
state is []; P,.,(”;). Of course, 3), P,() = 1. Then 


E 


xX Tl P, (nj) b £54 (0) + py Paani) 


minz,... ff 


» ( » I] Pan) oo Py (n€x,Sk,() 


not nm, 


im \ me... 
not 1y,%m 


+ ae y ul Pind) YY Pal) Peg (Min) Vier tig MDE Keg (Hin)- 


But 
Y TL i, = TY Pen) = T] 1 = 1. 
j#l j#l ny 


cs j#l 
not m 


and similarly, 


Y IT Pa) = 1. 
m,... j#im 
not 11,"m 


Thus, 


B= YY Py men Slr) + DY PPA WVerknta( ten’). (10.36) 
lon I,m n,n’ 


Suppose we call f,, the probability that a k state is excited. Then the prob- 
ability that a given pair of & states is unexcited is 


P,(O) = (1 ~ fx)’. (10.37a) 
The probability that one of the states (say k) is excited, and the other is not, is 
P,{2) = PB) = fil — Si): (10.37b) 


The probability that both states of a pair are excited is 


P,(1) = fz. (10.37¢) 
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From Eggs. (10.37) and (10.15), 
S,(0) = 2Ui, 1,(0) = ULV, 
S,(1) re Win t,(1) os —U,V,, 
S22) = S,3) = 1, (2) = 4,3) = 0. 


Thus, from Eqs, (10.36) and (10.37), 
E= % 2e,[UKC — 2fp) + fl + 2 Via Uae Vir Ua Vall — 2f, ML — 2f,). 
(10.38) 


Here we assume that U2? + VZ = 1, but we do not assume that U, is such that 
Eis a minimum for zero temperature. U, and f, are functions of temperature. 

For a given set of orthonormal states U, and f,, define a set of P; = prob- 
ability of state 7. Then 


E= YEP; 


S 


~k¥ P, in P; (k = Boltzmann constant) 


F(P,) = E — TS. (10.39) 


If the P,; are proportional to e~*/*™, then F(P,;) = free energy. It is easy to 
prove that if F(P;) is minimized with respect to P, (subject to the condition that 
> P; = 1), then the P; have the correct values for temperature T. 

Using the method of Lagrange multipliers, we find that 


OF _ ,dxP, 
aP, 


i i 
Then P, = e@/kT~De~ Eilkr 

A must be chosen so that > P; = 1, in which case the P; are correctly 
given for equilibrium. 

For the case of superconductors at a finite temperature, minimizing F with 
respect to P; is equivalent to minimizing with respect to U, and f,. 

Before we do this, note that we are really considering an F that has an 
undetermined number of electrons. We then minimize subject to the condition 
that the expectation value of the number of particles is a given number. So, for 
example, P, should be written P(n,, n2,...) where ,, n2,... are the number of 
particles in state 1, 2,... respectively. In addition to the condition that 


YY Pan, ng,...) = 1 


Ny M2, es 


0= 


= E,+ kT (inP, + 1)— a2. 


we have 


YX (ny t+ ng + ny +°°+)P (my, n2,...) = fixed number. 


1,82, 05. 


10.7 Finite temperatures 283 


The method of Lagrange multipliers then gives what amounts to the chemical 
potential of Chapter 1, Section 6. When working with superconductivity, we 
do not explicitly write the chemical potential because it is taken care of by an 
appropriate choice of the zero of the energy. 


S = entropy = —k > probability of state i x In [probabilty of state i] 


i 


-—k¥ Y P,{n) In P,{n). 
It follows that 
TS = —2p7! »» [f,inf, + d -—f,) nl — fi] 


6F _6E _ 6TS 


Sf, fe fe 
= 2e,(1 — 2U2) — 4 Y Vie Vie VieU Vil — fe) + 2p7* In are 
k’ — Ik 
(10.40) 
Set 
& = &(1 — 2UfZ) — 2 D; Via Uy Vir Uy Vil — 2fe-) (10.41) 
‘ 
Then 


fa = 1f(eP* + 1). (10.42) 


If we now take 


Fo cone ae eae aha Enz + 2) VaeUeVe (Ta _ v) (1 = 1) qd _ 2h) 
(10.43) 


we have (in principle) equations that can be solved for both U, and f,. Now, 
define 


A, = — py Vi Ue Veh — 2fy)- (10.44) 


Then Eq. (10.43) becomes identical to Eq. (10.20). All our results from Eq. 

(10.20) through Eq. (10.28) are true at finite temperatures. Equation (10.41) 
becomes 

é, = é,(1 — 2U2) + 2U, Vi Ay = Ey, (#1) + 2 ae A, = Ey, (10.45) 

k 


where we have used Eq. (10.44), then Eqs. (10.25) and (10.28), and finally 
Eq. (10.24). 
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The final result is 


Ae Vy a tanh oe, 

. (10.46) 

Ez = e + Aj. 

To simplify this equation, assume 

View = -V <0 for |le,| < Aa, 
Then A, becomes independent of k, and Eq. (10.46) reduces to 
h@er 
i os Le oe (10.47) 
MOV |, ET’ 


where M(0) is the number of states per unit energy at the Fermi surface. The ‘2” 
in the denominator of Eq. (10.46) drops out because f°... de and [ de are 
equal. When T - 0, Eq. (10.47) gives 


her 
eink hs OOse = 2| de exp [=ve? + A%/kT] (10.48) 
M(O)V 0 ve? + A? 
Assuming M(0)V is small, we find 
fies - AO) Shae So: (10.48’) 


MOV AQ)’ 


It is easy to see that as T increases, A decreases. There cannot be a value for 
A at all temperatures, because in the limit of high temperatures we get an equa- 
tion independent of A: 


M(O)V 7 


Lt re sees Pe Boles 
9 « .2kT kT’ 


and this equation may simply not hold. 

In Eq. (10.47), A appears only as a square (in E = Ve? + A?). The effect 
of decreasing A is to decrease E(e). But E(e) cannot be lowered below the value 
it has for A = 0, The critical temperature, then, is at A = 0, above which there 
is no solution for A. Putting A = 0 in Eq. (10.47) we have 


her 
eee Oishi we In {1.14 2c), (10.49) 
MOV Jo « ae kT, 
Equating Eqs. (10.48’) and (10.49), we get, 
2A(0) = 3.52kT.. (10.50) 


We write “‘2A(0)” because in experiments in which a superconductor is excited, 
we create both a hole and an electron. Equation (10.50) is experimentally 
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A/kT, 


1.76 


(e) 1.0 pe ees 
Fig. 10.15 A(T) as a function of 7|T,. 


C. s/yT, 


2.52 


; T/T, 


Fig. 10.16 Specific heat of a superconductor, C,,s, as a function of 7/T,. 


fairly good. Let T = T, — t. As t increases from zero it is clear that if Eq. 
(10.47) is to remain true, E must change from ¢ by an amount proportional to t. 


But E = Ve? + A? we + A?/2e, so A? is proportional to t. That is, if T ~ T.., 
A? = A(T. — T) 

for some positive constant, A. We conclude that the general shape of A(T) is 

as in Fig. 10.15. 


Another check with experiment is the specific heat whose electronic part 
is of concern to us. The normal specific heat C,,, 18 proportional to T: 


Co, = IT; 


en 


Y being a constant. For the superconductor, C,,, is plotted in units of yT,. It is 
as shown in Fig. 10.16, which is in agreement with experiment. 


10.8 REAL TEST OF EXISTENCE OF PAIR STATES AND ENERGY GAP 


Any phenomenon in which scattering of electrons is involved will serve as a 
test for the existence of the pair states. Attenuation of phonons and para- 
Magnetic relaxation are examples. 
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k‘o’ 


ko 


Fig. 10.17 Electron scattering. 


In such phenomena the interaction part of the Hamiltonian includes thi 
form 
H' = Yo By elie Anes (10.51 
ka,k'a’ 
where o is the spin coordinate. Equation 10.51 represents the amplitude for a: 
electron with ko being scattered into k’o’, as shown in Fig. (10.17). When th: 
pair states proposed in the BCS (Bardeen, Cooper, Schrieffer) theory exist, ; 
scattering of an electron kf induces an interference with the paired electron a 
—kl. 
The coefficient B usually has a symmetry like that shown in Fig. 10.18 


There are two cases to be distinguished: 
Brg ho = Boge~o,-k'.-0' (Case I), 
oat eed (10.52 
Bua’ sko = Big pce eg! (Case I). 


In the ordinary scattering case we write a table: 


kt k’t 
Initial occupied empty 
final | empty occupied 


That is, we consider the scattering of an electron from kf to k’f. 

For the BCS case, the table should include kT and —k| together. When we 
speak of the scattering of a kf electron into k’t, we mean that we take the initial 
wave function with W, = @,(2) and Wy = Uyo,(1) + Vi-o,(0). Then final 


k,o 


Fig. 10.18 Symmetry of the coefficient B. 
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wave function is w,, = @,(2) and Wy, = U,9,(1) + V,@,(0). Then the matrix 
element for the transition is 


(W2| Al = (@u(2)1( Opp!) Fe Vic PlO) (By 41c¢ Uate 4 Meet 
zB B_gy— 4/474) 4-4'4) 
x en2)> Upon) + Vie) 
= Byny VV + Beuy ney Une: 
= Berns ViVar + UU). (10.53) 


The + sign gives the two alternative cases (1) and (II) in Eq. (10.52). Changing 
the sign from plus to minus causes a big difference in the scattering. Qualitatively 
‘we can understand the change when we consider the case k’ ~ k. Then we see 


Vi+U2=1; V2—U2=1—2U2 = 6/E, = &/vez + AZ. (10.54) 


When the sign is positive, the scattering is enhanced, whereas when it is negative 
the effect is reduced. 

Phenomena such as attentuation of sound, thermal conductivity, and 
paramagnetic loss have been examined and the theory can explain the experiments 
well. 

Let us now discuss some gap experiments. Suppose we form a junction by 
placing a superconductor with a thin insulating layer (such as an oxide of the 
superconducting material) against another metal, which can be either a normal 
metal or a superconductor. Ifa voltage is placed across the junction, the amount 
of current will be affected by the presence of the gap. Semiquantitatively, we 
can set up the problem as follows: 

Label the metals 1 and 2. Electrons in a state described by “‘k”’ in metal 1 
will have an amplitude to go to state ‘“‘g” of metal 2. Let that amplitude be a 
constant, M, in the energy range that interests us (near the Fermi levels). The 
probability of transition from state k of metal I to some state of metal 2 is 


2 
. Dy |M |? (probability of state k being occupied) 
q 


x (1 — probability of state g being occupied) x 6(E, — E,), 


where E, and E, are the energies of the respective states. The probability of 
State k being occupied is 
1 


FE) = exp [E,/kT] + 1° 


and similarly for state g. If voltage —V is applied to metal 2, the energy of 
State q becomes E; = E, — V, and the probability of g being occupied is the 
Same as before, namely /(E,) = f(Ej, + V). 
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The current from metal | to metal 2 is 


TL IMEFEDL — SE, + VY] Es ~ EY). 
If 
I = the total current = current from | to 2 — current from 2 to 1, 
we find that 


_ 2ne 


au IMP?{f(E)L1 — f(E, + V)) — f(E, + VLE — f(E)]} OE, — E 


a) 
2 a IMI? © Lf) ~ f(y + V1 SE — E 4). 


Now 'x,q can be replaced by J dE, dE{p,(E,)p2(E,), where p,(E,) and Pr E,) 
are the density of state functions in the respective metals. Then 


ur) = 7 Mp | dE[f(E) — f(E + Vy|p(E)px{E + V). 


At absolute zero, 
I 2ne 2 0 7 
(0) = IMP | dEp(E)px(E + V). (10.55) 
-V 


Thus the current depends on the density of states near the Fermi level, £,, 
for the two metals. For normal metals we can consider p(E) to be roughly 
constant, but for superconductors the interactions between electrons cause the 
spectrum to be distorted. The state that would have had energy «, without the 
interaction actually has energy |E,| = V 62 + Aj. For simplicity, take A, = A 
independent of k. Thus there is a gap around &, = 0. While dN/de (which 1s 
the density of states if there were no interactions) is roughly constant, 


dN|_ dN 1 _ dN |E 
dE, dey \dE,/de,| de JF? — x? 


p(E,) = 


rises to infinity as the gap is approached, then drops to zero within the gap. 

If metals 1 and 2 are normal, p,(E) and p,(E + V) are constants, and / is 
proportional to V. But if metal 1 is a superconductor, p,(E) is zero for all £ 
in the integral of Eq. (10.55), unless V > A. Thus for V < A there is no current. 
For V > A, it is easy to show with our simplifications that J oc af V2 hi 
and looks like the function in Fig. 10.19. At finite temperatures, the curve 
looks like the function in Fig. 10.20. 

This situation, with metal 1 a superconductor and metal 2 normal, can be 
visualized as shown in Fig. 10.21. At V = 0, E,, is the same as E,,. When V 
lowers E,;, more than A, below E,,, current can flow from | to 2. 
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0 ay 


Fig. 10.19 Current across a junction with one superconducting metal at zero temper- 
ature. 


Ay ua 


Fig. 10.20 Current in the same junction at finite temperatures. 


Unoccupied 


levels at 
Unoccupied 
Gap 4 levels 
a Er, 
Filled 4 
levels 


1 (superconductor) 2 (normal) 


Fig. 10.21 Levels in a junction of two metals, separated by an insulating layer. Metal 1 
is a superconductor and metal 2 is normal. 
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I I 
{ ry 
T=0 T>0 
to frre + 
oO 4, +4, O) |4,-4p| A, +4, 


Fig. 10.22 Current in a junction of the superconductors, at zero and finite temper- 
atures. 


Suppose both metals | and 2 are superconductors. The current as a function 
of voltage looks like the curves in Fig. 10.22. From the second curve in Fig. 10.22, 
both A, and A, can be obtained. For more details concerning the energy gap, 
see Progress in Low Temperature Physics, Vol. IV, page 97 and on (especially, 
page 140 on). 


10.9 SUPERCONDUCTOR WITH CURRENT 


In Section 10.4, we assumed that the electron pairs had zero total momentum, 
so thatk, + k, = 0. The more general situationisk, + k, = 2Q = constant 
for all pairs, in which case there exists a current. As before, we can describe a 
pair by a single momentum vector, k, but in this case, 


ki =k+Q, 
k, = —k + Q. 


Formerly, the energy of a pair was ¢, + €_, = 2&. But 


h? h? 
5 (k + OY + (-k+ OP = 25 422", 
2m 2m m 


so 
Ey, + &m, = 2e, + 2(h?Q?/2m). 


In analogy to what we did before, we define V\Q) as Vis9,-n+04°+0,-k +0" 
Notice, however, that V (2) = V,,- + terms of the order Q? or less. The change 
in V,,- of order Q? is similar to the doppler frequency shift in sound emitted 
from moving objects. Neglecting “‘Doppler shift’’ we modify Eq. (10.17) only 
by the replacement of 2e, by 2e, + 24?Q7/2m. The energy of the ground state 
is increased by (47Q?/2m) >, 2UZ. 
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The number density of electrons is W = | 2Uj, and we can define » = 
f#Q/m. Then 
excess energy = W(mv?/2). (10.56) 


r= (W> v), 


e 
=. P 
m 

where p is the momentum operator. It follows that 


The current is 


y=" ¥ [+ O) + (-k + QUE = -e “2 ¥ 2U7 = - yer. 
m k k 
(10.57) 


How is the wave function changed by replacing pairs +k with pairs 
+k + Q? The equations for U, can be brought up to date by replacing 
e, with ¢, + #?Q?/2m. As well as 4?Q?/2m, we must add another constant 
to make the zero of the energy appropriate for the correct number density of 
electrons. The net result is that the zero of energy is again at the Fermi surface, 
and the U, come out the same as before. But now the U, describe the amplitude 
for +k + @Q to be occupied. 

Suppose we describe the ground-state wave function by Wo(x,, X2,...). 
We want to modify this wave function to become Wo(x,, X2,.-.). This modifica- 
tion is effected by replacing each component of momentum p by a component of 
the same amplitude, but of momentum p + #Q. That is, ¢'?*’*«/" is multiplied 
by e!2'*e, Then 


Wo = exp (2 : y *.) Wo = exp : 7 ‘ » «| Wo: (10.58) 


Equations (10.56) and (10.58), it should be noted, represent simply the 
Consequences of a Galilean transformation of Schrédinger’s equation. According 
to our approximations, a superconductor with a current is the same as a super- 
Conductor in its ground state as seen by a moving observer. 

In a real metal the current can vary from place to place. Because k is 
large near the Fermi surface, the lowest-energy wave functions are not affected 
Much in energy if the variation in current is slow. 

To see how to deal with variation of current with respect to position, 
assume a metal is divided into regions. Each electron is constrained to move 
Within a given region, and for each region there is a different current. Then the 
Wave function becomes 


y exp ity, > X_. } &Xp eee re oy Xe “Wo 
h electrons h region 2 


in region 1 


exp (? Y ox.) - x.) Wo- (10.59) 
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We can now let o(x,) vary gradually with x,. But this is not quite right. To , 
why this modification of y is inaccurate, remember that the equation for curre, 
density involves derivatives. The fact that v(x,) is not constant modifies ty 
current. 

To improve our description of a current in a superconductor, let us ne: 
try 


w = exp ( py ax) Wo: 
The number density of electrons is given by 
N(R) = yf vr ace, — R) dx, dx,---, 
and the current density is given by 
FC | [WtVey — (WW) (x. — R) dey dy, 


If we remember that the current for Wy = wo is zero, we find 


(R) = - ve A vo. (10.60) 
m 


For example, if 9 = mv-x/h, j(R) is the same as that for Eq. (10.57). 

Note that we have only described currents for which V x j = 0. For the 
steady state, V-j = 0. 

In a ring, the above conditions can be satisfied with nonzero current. In 
this case, $j ds = —WNe(h/m) AO, where (A@) is the difference in 8 as we go 
around the ring. A@ need not be zero, but it must be a multiple of 27, for we 
require that the wave function be single valued. If we define v = —j/.4-e = 
(f/m)V@, then 


eds 7 ann”, (10.61) 
m 


where n is an integer. If the ring’s central hole shrinks to a point, the only wa} 
ncan be kept greater than zero is to have lines along which the superconductot 
breaks down. 

By calling » = —j/We, we have implied that this quantity can be thought of 
as a velocity. This interpretation is confirmed by computation of the expected 
value of the energy. 
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p= [ytHy = {fv [25 etry 


me | Yl Vb + [uve 
2m e 


a pa [ere [ow Vow) (x. — R) + [yve 
Me 


WV = WoViho. (10.62) 
Also, 
vy* “Vy = Ws ‘Vo + iV6- [WoVs = OVW] + (vo)? wo Wo. 


Equation (10.62) becomes (if Eo is the energy for y = yo): 


E 


Ey + x [ a°xivoce) : [z [WoVeWe — WeV.bo] (x. — R) 


ee | eexivo ¥ vibod(x, — R) 
2m e 


Ey + f «| eRvaCR) Jo(R) + Joy | RIVOCR N(R). (10.63) 
e 

But j,(R) = current for ground state = 0. Then 

E=E)+ [exam N(R), (10.64) 


as we would expect from the interpretation of v(R) as a velocity. 


10.10 CURRENT VERSUS FIELD 


In the equation j(r) = —AA’(r), A’ is averaged over some region about the 
point r. In other words, we must use a sort of ‘“‘spread-out” A(r). We can 
interpret this spreading as an effect of the size of the electron pairs, and we can 
calculate the spreading by means of the BCS theory. 

Empirically we have 


Ar’) = ee pele (r ome! iG = r’) d°r, (10.65) 


lr — r'|* 
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where C is an appropriate normalization factor (so that constant A give, 
A’ = A). Note that Eq. (10.65) has the form of a convolution. If we write 


ror [iment ae A(q) = | Arye" dr, 
(10.66) 


i —r/Eo 
K@) = | K(ye"®" Br, K(r) = CO 
r 


4 
then 


i(q) = A(q) - K(q). (10.67) 


In what follows we will find the component of j(q) that is in the direction 
of A(q). 

Suppose that A(r) is in the z direction and has magnitude Aye” '4°*, where g 
is perpendicular to A (we are using a gauge in which V- A = 0). Then 


Aq’) = [Aven d°x = Ay 9 q,0’> 
where everything is normalized within a unit volume. 

In Section 10.1, we showed that if the wave function is “rigid,” jy = —AA. 
In so far as the wave function is not rigid, we get a contribution from the 
paramagnetic current of 


i= = [yvy — (WWW) dota =F + Jo: 
im 


Set A = 1. By considering wy as an operator (as in second quantization) we could 
get the operator for j(r). But we will use a slightly different approach. 
For a single particle, the operator for j,{r) is 


ir) = —* [P, 6 — ry) + 6" — Py], 
2m 


where r, is the operator for position and r is a vector (a set of three numbers). 
Then 


i) | Cisye#t = | Wiliu@iyyenr ar 


ll 


= [fan a°ng*(r;) E V, or — 4) + or =n) + v. (re ia” 
2m i i 
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After some manipulation we find 
; =e iger igen 
Ai@) = — WlPye "9" + eT PW). 
2m 
Summing over all particles, we get 
j : ae ~igeri ~iqer 
iq) = » ila) er «| » [Pe +e @ "Pp. — (10.68a) 


By, say, >, e-“"°"'P,, we are representing the operator, acting on all the particles, 
that on one particle is e~'4°"P (r and P are operators), Using 


ear = | ateeraeyca 


and 
_ ( d°P 
(2z)° 


P|P)<P| 


it is easy to show that 
e '9'P =) kik — q)><kI. 
k 


In Chapter 6, Section 8, we learned how to find the corresponding operator on 
arbitrary numbers of particles. The result is 


ye arp, = Y ka*(k — g)a(k). 
I k 


Similarly, 
Lhe = Yt — q)a*(k — q)atk). 


Equation (10.68a) becomes 
iq) = — WY 2k — q)ag gar). (10.68b) 
2m k 
Because q is perpendicular to the z axis (i.e., to A), 
ja) = —* CY, ka gael. (10.69) 


The wave function ly can be easily found by means of perturbation theory. 
The Hamiltonian for the superconductor with no field has a term 
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This is modified to become 


ue eA(x,)\? 
yy G + = ) ‘ 


2m 7 


Using the fact that V- A = 0 and neglecting terms of second order in 4, we 
see that the Hamiltonian is modified by 


e e igi eA 
= — > A(x,) & P, = -—— x Age? Pr = = en0. » k Ais gy. (10.70) 
me T mc 1 mc 
The wave function is 


I> = lod + » AnlWn> 


where 


, 7 + 
ee (n|H'|0> _ —eAo <n| De keag+g@ul0) (10.71) 
Ey — E, mc E,, — Eo 


Here we are using |”) and |w,,> interchangeably. | > is the ground-state wave 


function, described by Sections 10.4 and 10.5. Putting together Eqs. (10.71) 
and (10.69), we obtain 


5 —_e 
fg)» = «a Y; kag qaul0) +E A2Gn1 Y kee aul) 
+E ACO kadai) | 


We will calculate the third terms of the above result, and it should then be easy 
to see that the first two give zero. 


n 
C01, Keak gauiny = Cal (Z kata) 10> 
From 
re 
(x kaka) = Di k.ag deg =i k,ag.qa, (because k, + gq, = k.) 
k K 
and setting A + q = k’, we find 


iq) = Soy el re aa (10.72) 
™~ 490 


The ground-state wave function is a product of terms such as 


W(0) = U, kT, —kl> + V,10, 0>. 
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k,4u1%, Operating on W,(0)Y,-(0) gives k,U,V,.|0, —kL>|k’'T, 0). In other 
words, k,a,,,a,, takes the state 
kf, —kl kt, —k'\ 
occupied empty 
into the state 
(—k), (k’T) kt, —k') 
occupied empty 
with amplitude k,U,V,,.. We can get a similar final state by using the operator 
—k,a*,,@_,-,. This operator takes 
kt, —kl kt, —-k'\ 
empty occupied 
into the same state as that found by using k,a,',a,,, but with amplitude 
-—k,UyV,. Then 
e7Ay y |<k and k’ excited|(k,a,,4,, — k,a*,a~4-)10)|? 


iA) = wee 7 E, + Ey 
2 2 & ag 
— © Ao y UV = VU)” (10.73) 
mc ‘k Ex + Ex 


The algebra in the above derivation is not rigorous. For example, the signs 
have not been unequivocally proved. The reader should do the algebra himself 
to check such things as the correctness of the signs. It is important to be con- 
sistent ; decide, for example, whether |kT, —k1) is a,4a*,,|0,0) or a>,,4,4{0, 0). 
The following steps are straightforward: 


(UV — Vale)? = URViEn + VUE — 2UV UV ee. 
From Eqs. (10.26) and (10.28), the right-hand side becomes 


1 ies Ay An Exe: 
2 Eko f° 
and Eq. (10.73) becomes (because j,(q) = K,,(q)4.(q)) 


Kew) = e? y k? 1 — AnAw teen: 
a. 2m7c ‘KE, + Ey: E,Ey 


with Ex = V2 + A? (Eq. (10.24). 

Bardeen, Cooper, and Schrieffer find what amounts to K(q) for arbitrary 
Components and nonzero temperature. They also show that at zero temperature, 
the result approximates the empirical formula Eq. (10.65), with & = 
0.15 to 0.27 in place of the factor 0.18. Considering the approximations we 
(and Bardeen, Cooper, and Schrieffer) made, the theoretical results are very 
good. 


298 Superconductivity 


10.11 CURRENT AT A FINITE TEMPERATURE 


To compute the energy of a superconductor with a current and at a nonzero 
temperature, we must modify Eqs. (10.36) and (10.37) as shown in Table 10). 


Table 10.1 


What was formerly 


Becomes 


&5,(0) = 2¢,U2 
ey5y(1) = 26, V2 
&5,(2) = & 
&5;(3) = & 


P,O) = Ud — fd - fd) 


Py) = SR 
P,(2) = Ay sy Si) 


P,G) = A — fi) 


= (1 — f,)’ 


(Cxs9 + €-ni gQUi 
(Gx4Q + & nigh V Zz 
&K+@ 


€_K+Q 


ad a Sir ov = f-x+9) = Probability 
that neither (A + Q) nor (—& + Q)are 
excited. 


Iu+of-n+9 


Sx+Q(l — f-nsg) = Probability that 
(&k + Q) is occupied (excited) and 
(-—k + Q) is not. 


f-nsgl — fir+g) 


The energy is 


E= 2 (it — Sas Ql — fens ore + 8+ Qi 


+ Lfarof-n+ol%e+0 + &_nighVi 


+ dX Sasol = S-ns [e+ + Lf-n+0 =F Ses Qt-K+0 


+ VY ViPUY UVLO — fara — f-nso) — frsof-nia)] 


k,k’ 


x [1 = feral — fw +0) — furol-nsoal 


= VY Ulaso + e-n+gQUil — Baro) + erohrol 


k 


+ YY VIPU MU Val — feral — 2fer+o)- (10.74) 


k,k’ 


Here we have used U, = U_,, V, = V_, and Vige = Vite = Vu~qe- 
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So far, we know neither what f, is nor what U, is. As before, we must 
minimize F = E — TS, where 


TS = — p" > » P,(n) log P,(n) 
— Bp » Lfn+o log fargo + f-n+g log f—n+g 


+0 - Sura) log 1 ~ Sao) + (0 - f-n+9) log (1 — f-nig)] 
~— 2p7* py Lfe+o log fit+o + - Siro) log 1 — Sut]: (10.75) 


If we vary with respect to U,, and let 


A, =- » ViPU eV — 2fx+@)> (10.76) 


then we get an equation similar to Eq. (10.20). In fact, if we neglect errors of 
order Q”, we get Eq. (10.20) exactly, and everything from Eq. (10.20) to Eq. 
(10.28) is again true. More generally, we can define e(k) = 4(é%19 + & x49); 
and Egs. (10.20) to (10.28) hold with e(k) in place of &,. 

Next, vary with respect to f,,9. The result is 


0 = —2e(k)U2 + frg + 2A,UW, + Bo! log Set 
— Jk+Q 
= Eig — &k) + Ex + B~' log — Serg 
I= Sito 
2p. 
= FEO. FL + Blog Seto, (10.77) 
m Tse Si+o 


from which we find 
peas 1 while 
exp {[E, + #(k- Q/m)]B} + 1° 


Equation (10.78) is what we would expect from a Galilean transformation 
of velocity v = AQ/m. 

We might also expect that the current should be the velocity times the total 
density. In fact, this is not the case. The actual current is less than this, for the 
energy of excitations causes them to prefer to move in such a way as to oppose 
the flow. 

The effect described above should not be confused with ‘“‘back flow,” 
which is instead an effect that comes about as follows. Consider the problem 
of a wave packet of excited electrons in a superconductor. Such a wave packet 
can be described by an approximate wave function, which, in turn, can be used 
to find energies and localized currents. But it turns out that the approximate 
wave function first used did not conserve current. An improved trial wave 


Suro = (10.78) 
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function (which does not improve the energy much) does conserve current, 
“Back flow” is the additional current that arises from the correction to the wave 
function. 

Now, let us see how much current we get from the wave function we have 
chosen for a uniform current in a superconductor. 


Electric current x m(—e) = momentum 


= » Probability of state i x momentum of i 


= = 3 P,(n) x contribution of the + k + @Q states to the expectation 
value of the momentum 


= LO ~ feral ~ Surg Uae + Q) + (—k + OI 
+ % fas o(! — f-xsgk + Q) + LS nol — fxso(-—k + Q) 
+ Dharof-n+oVilk + Q) + (—k + 9) 
=@ » [2UKC. — 2fn+9) + esol + XM favo — f-nsg)- (10.79) 


The term multiplying Q can be written as 


QO > 2Uj +0 » gd - QU fk+0) = NQ + 20 Pho 


(see Eq. (10.25)). 


Dia Ses g&(k)/E, is roughly zero. To see why, consider f, 49 and e(k)/E, as 
functions of energy, e. (See Fig. 10.23.) f,4g is roughly symmetric about the 
Fermi energy, and e(k)/E, is roughly antisymmetric. Their product is roughly 
antisymmetric; so it integrates out to zero. 


Sta 


e(k)/E, 


—e Energy 


\ 


Fermi energy 


Fig. 10.23. The functions f,42 and é(k)/Ex. 
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The term in Q is therefore, .”Q. The other term in the current is 


d*k k 
Hive Sone) = Fhe = 2 Os ay (Bp 4 Bll FT 


Up to first order in Q, the integrand of this term is 
7 7? en ~  &  _ kk Q/m)Be*** 
exp ((E, + (k-Q/m)|B} +1 ec 41 (eh + 1)? 


The only contribution after integration comes from the second term in the 
integrand, which by symmetry gives 


9 d*k kk: Q/m) Ber? Fe k? efx 
(22)? (efKs + 1)? alles (eEke rf 12° 


Summarizing, we see that Eq. (10.79) leads to: 
J = -NVeer, 
where VW, = effective density of superconducting electrons = W — SW ,; 


N= 2 | U{ = numerical density of electrons, 
2B ( d*k — k?ebKe 
(2n)* (ef + 1)?” 


Nn = 


v= =. (10.80) 


As the temperature tends to zero, B approaches infinity, ,y;, approaches 
zero, and 7, tends toward 4~ 4, comes from the excitations, and can be 
considered as a number density of normal (nonsuperconducting) electrons. 

To help us see how ;, behaves, let us make some approximations. 
Suppose that the integrand of 4%, is appreciable only near the Fermi surface in 
k space. This assumption is a consequence of the assumption that f,,9 con- 
tributes only near the Fermi surface, as in Fig. 10.23. Then we can replace k? 
with k2, and 


M (0) de, 

= ; lc ” 

where (0) is the density of states ate = 0. We have 
4nk? dk _ 4nkmkdk _ 4nkm 


M(e) de = 
(2n)> (2x)? m (2x)3 


de. 
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Therefore 
M(0) = 4nkpm 
(2)? 

Also 

_ 4 ake 

3 (2z)3 
The factor of 2 comes from spin. 
Putting all this together yields 
[7 ae iz BA bavi tx? 
ee ac (ef 4. 1)? ‘sie (oPAVI +? ye (BA). 


(10.81) 
Here y is a function defined as 


cd ezvitx? 


We already know that at absolute zero, y = 0. At the critical temperature, 
A = 0, £, = e,andy = 1. Inother words, v, = Oat the critical temperature, 
and there is no superconducting current. j; is given qualitatively as a function 
of temperature by Fig. 10.24. 

Without much effort we can generalize this derivation. Suppose that instead 
of simply minimizing F, we minimize F subject to the condition that there is a 
fixed expectation value of the momentum. We then do not get a stable state, 
but instead a state that will decay into a stable one. An outline of the procedure 
follows. 

Where before we minimized F, we now minimize F + V,-<¢P)>. The 
components of V,, are Lagrange multipliers with the dimensions of velocity. 


Ns 


N 


Fig. 10.24 Number density of superconducting electrons Ns, as a function of 
temperature. 
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We fix the value of (P» at the beginning of the problem, and at the end of the 
problem we adjust V, so that (P) has the right value. The result is 


2¢ ak 7 k 
m J} (2x)° exp {[E, + k-(V — V,)|B} + 1 


where V = Q/m. We have already worked with the integral in this equation, 
and it is easy to see that 


j= —-MeV + N,V — V,) = -N eV — NV ,eV,. (10.82) 


j=—wev 


Another result obtained using the same ideas is 
Energy (at fixed entropy) = 4.1 mV? + 41 ,mV?. (10.83) 


It can be shown (with some difficulty) that V,, is the velocity of wave packets in 
the superconductor. 

Equations (10.82) and (10.83) can be summarized by the following slightly 
nonrigorous statement: 


In a superconductor there are two types of electrons—normal electrons, 
which carry currents that decay due to resistance, and superconducting electrons, 
which “‘short circuit’? the current carried by normal electrons. 


This statement is misleading because sy, and y;, do not really count 
electrons. _;,V is defined in terms of a certain integral, and jj; is given by 
N — WN; Ina similar manner, liquid helium can also be thought of in terms of 
a two-fluid model, and here, too, the model is satisfactory if used carefully, 
although it should not be taken too seriously. 

10.12 ANOTHER POINT OF VIEW 


Schrédinger’s equation can be found from a variational principle of the form 
6 \- d*x dt = 0, (10.84) 


where the Lagrangian density is 


LW, U*, WW, WU", w*) 


Pll [tp ome tes 
miti c i € 
(10.85) 


Finding w and * such that { Y d*x dt is an extremum is equivalent to solving 


the equations 
dL/dp = 0 (10.86) 
5L/sy* = 0 
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where, for example, 


6 02 dd ed 08 

oy Gh dt G1 dx, AOpjex,)- 
Using Eq. (10.87) with Eq. (10.86), we get Schrédinger’s equation and ity 
complex conjugate. 


(10.87) 


If we write y = Vp e®, we can consider ¥ as a function of p and @ and 
their derivatives. Then 


OL - 6L Op bf oy* 
dp oy 6p  dw* ap 


0 (10.88) 


and 
bL/60 = 0. 
In terms of p and 6, 


1 gA\? hh? = ; oA. 
L= pf Avo — %) + — (Wp): (Vvp) + QVp + pb + — fp. 
2m c 2m 2i 


It follows that 


imu? + qvV + U = —hi, 
(10.89) 
V-pv+ p=90, 
where 


v = (I/m\(hVO — gAlc) and =U = —(A2/2m)I|/Vp V2Vp. (10.90) 
Incidentally, 


ima Ge Eye + [Ge - Sy ft = 
2m i c i c 


is the probability current, not the electric current. 

Now, how does all this apply to superconductivity? In superconductors. 
the electrons in the ground state come in pairs with opposite spin. The linear 
combination of such pairs with the lowest energy can be considered a particle. 
Since the spin is zero, the “particle” obeys Bose, statistics, and the ground state 
of a superconductor has huge numbers of “particles” in the same state. If p is 
such that y is normalized, Eq. (10.89) describes the motion of a single ‘“‘particle,”’ 
but because the “particles” are bosons, any number can be in the same state 
and p can be the number density. 

Note that ¢g = 2e and m = twice the effective mass of the electrons. Note 
also that if p is the number density, we are obliged to modify Y in some way SO 
that p does not deviate too much from some value characteristic of the material. 
say p,. We must add to¥ a term 


a = 2 
5 Ps)’ 
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a does not come from electrostatics, for the electrostatics specifies only that 
N; + N, = constant, where “7, = p. We will not discuss x any further. 
Equation (10.89) can be replaced by the equations 


Vyja? S0, 
ot 
mo + (v- ce x B) + VU, (10.91) 
ot 
where 
Be syst. -#S Vea 
c Ot 


Equations (10.91) are merely the equations of hydrodynamics. The first 
equation is the equation of conservation of the fluid, and the second is F = ma, 
with U interpreted as a pressure. Because of the «/2(p — p,)? term in Y, U is 
no longer given by Eq. (10.90). Just as Maxwell’s equations describe the motion 
of many photons in the same state, Eq. (10.91) are macroscopic equations for 
many superconducting ‘‘particles” in the same state. 

From Eq. (10.90), mv + qgA/c = hVO@ is a gradient; therefore its curl is 
zero. If B = V x A is zero, V x v = 0 and the fluid is irrotational. From 


Eq. (10.90), 
2 
ee ee | (me + 4) as, 
‘ c 


If point | = point 2, we get 


A(2nn) = (me + 14). as = 1 o("s" + A) a 
c c q 
=4 bias + A)-ds, 
c 
where n is an integer and A’ = mce/qp, = —2m,c/2ep,. 
_2nhe | -7 2 
(A‘j + A) = ar x —(2.09 x 107’ gausscm*)n. (10.92) 


Hereafter, replace n by —n. 
For a simply connected superconductor (i.e., one without holes) V and A 
are continuous everywhere; son = 0. It follows that A = —A‘j.* But suppose 


* This equation is the same as Eq. (10.2). Remember that the j of Eq. (10.2) is the 
charge current, not the probability current as above. 
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the superconductor is in the form of a ring. The field does not penetrate very 
deeply into the ring (see Section 10.1), so there is no current deep in the ring. 
For a path of integration near the middle of a reasonably thick ring, 


2.09 x 10°7n = bens + A): ds = 4 ‘ds = ig dS. (10.93) 


Equation (10.93) predicts that the flux through a ring is quantized, and this 
prediction is confirmed by experiment. Furthermore, if g were e instead of 2e, 
the coefficient of n would be twice as large as experiment gives. 

There are some other, very interesting, applications of the point of view 
introduced in this section. Consider an insulator between two identical pieces 
of superconductor (Fig. 10.25). 

If Ax is sufficiently large, the two superconductors do not affect each other, 
and 


h, 

5 Y= Eww, 
h. 

_ YW. = E,W. 


E, can differ from £, if there is a voltage across the insulator. If Ax is very 
small, the phenomenon of barrier penetration allows some of i, to leak into the 
region of w,, and vice versa. Since Schrédinger’s equation is linear and homo- 
geneous, the rate of leakage into region | is proportional to Ww. and vice versa. 
It can be shown that 


ors Ww, = Ey, + ao, 
(10.94) 
ei Wo = E,W, + a*y, 


where a is real if there is no vector potential. If there is a vector potential, 


‘ance (igA/fic) > Ax 
a, = a,e A 


Superconductor 
vy 


Superconductor 
¥, 


}+——— ax 


Fig. 10.25 An insulator, width Ax, between two identical superconductors. 
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We will take a real, and leave the more general case for the reader to work out. 
If we write w = Vp e’®, Eq. (10.94) yields 
rn ae id Be , 
Eat hop, = E,v'p, + av'pp exp [i(8, — 0,)] (10.95) 
Pr 


sei?? 


and a similar equation with 
Eq. (10.95) we find that 


and “2” exchanged. Equating real parts of 


ho, =a je cos (0, — 0;) + Ey 
Pi 


and exchanging ‘‘!” and ‘‘2”’ gives 


hb, =a ae cos (0, — 0,) + Ep. 


P2 
Similarly, 
: PY ay acme : 
j= a> V psp sin (0, — 9) = —o. 
j = —/p, = the current that flows through the junction. Define 


2a j= _ 2ap, 
5=(0,-9,), jo = V0.0. > SP. 


If p, © p2, we get 


J = Jo sin }, 
dd _ E,— Ey _ qv _ 2eV (10.96) 
dt h h h? 


where V is the voltage across the junction. 
Now, suppose we construct the device shown in Fig. 10.26. If6 = 0, — 0, 


6, 82 


bo. Junctions meer 


9 8 


Fig. 10.26 A two-junction device. 
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and 6’ = 6, — 6}, then 
Jot = current that flows freely through the device = jo(sin 6 + sin 6’). (10.97) 
Inside the superconducting part of the loop V@ = (1 A)(mv + (qA/c)), so 


2nn = change in @ as we pass around the loop = 6’ — 6 + , (me + = “ds. 
c 


But if the wires and junctions are thick enough, »v is small, and 
mn = 5'-5+ 4 A-ds = 85 -—54 1%, (10.98) 
he he 
where g = § B- dS is the flux through the loop. 
Equation (10.97) can be graphically represented if we notice that 
rot = the imaginary part of jo(e® + e), 


In the complex plane, 7 behaves as shown in Fig. 10.27. A small voltage can 
cause 6 and 6’ to change with time, but 6’ — 6 is a function only of the flux 
through the loop. With a given flux through the loop, the maximum value 
Jeo C2N have is by simple geometry 


J max = io cos @ > 9) = 2jo COs (10.99) 


c 

We predict J,,,, to vary periodically with g as shown in Fig. 10.28. The distance 
between successive maxima represents a vary small flux (of the order of 107 ' 
gauss cm). Something like the graph of Fig. 10.29 is actually observed. 
Figure 10.29 differs from Fig. 10.28 in three respects: 


1. the scale is different; 
2. as ~ increases, the oscillations die out; 
3. a nonzero lower bound arises for the oscillations as ~ increases. 


The above experiment is analogous to the double-slit experiment in optics. 
In optics, if the slits have finite width, the intensity must be multiplied by the 
diffraction pattern of a single slit, and for large angles the oscillations die out. 
In our experiment, there is some magnetic field passing through the junction 
material, so 6 and 6’ are not uniform along the junctions. The spreading of 6 
along the junction corresponds to the “‘width”’ of the slit in the optical double- 
slit experiment, and it has a similar effect. 

The nonzero lower bound that appears as the oscillations die out is 4 
consequence of the fact that v, the velocity of the electron pairs, is nonzero. 
The physical apparatus as pictured in Fig. 10.30 does not really have its left 
side identical to its right side, so $ v d, need not be zero. 
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Teor § 


Fig. 10.27 Current jg in the complex plane. 
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—-+ ——_— ¢ 


Fig. 10.28 Theoretical periodic variation of maximum current jmax. 


Imax 


——» > 


Fig. 10.29 Experimental variation of jmax. 
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If the magnetic flux through the loop is produced by a solenoid through the 
loop, the oscillations do not die out because no magnetic field passes through 
the path of the current. The fact that varying @ does cause j,,,, to vary shows 
that it is the vector potential, A, that directly affects the motion of the electrons 
The B field was invented in order to describe the motion of objects in terms of 
something that exists at the position of the objects. Here the vector potentia] 
serves this purpose. 

We can run a current through the bottom film of superconductor in the 
device pictured in Fig. 10.31. If the bottom film is doubled on itself no magnetic 
field is produced in the loop. But now we have added a term 


m m m 
— | v-ds = — |j-ds = —— j- ds 
h | hips } : hgp, | - 


to the phase, 6 — 6’. 

The integration path is along the bottom film of superconductor. By 
measuring the electric current needed to produce a given shift in the position 
of the maxima of the oscillations, we can find m/fip,g. Recall that A = p,gq/m. 

Another experiment we can perform is to rotate the device pictured in 
Fig. 10.30. If b is a vector, 


db\ | (2) 
ge ™ \ a0 Jas measured in a system 
dt rot dt rotating with angular velocity @ 


So, for example, if the position of a particle is called r, 


dr 
Vor ={--) =v-@xr. 
(so Ge , 


Gor = eo = dy) — @ X (W)rot 
rot 


dt dt 
a 
dt 


a—@ xX [(v)o + @ xX r] — @ X ()rot 


dr 
—@X — — @ X (®),6 
at (¥ rot 


a+ 2AV)o X O - @ X (M@ x Fr). 


Neglecting second-order terms in w, we see that in the rotating frame there 1s 

an added effective force of 

GW) yg 2M 
c q 


that is, an effective magnetic field B = 2mac/q. 


2M )rot X O = 
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el 


Fig. 10.30 The “‘double-slit’’ apparatus. 


Fig. 10.31 The same, with the bottom film doubled on itself. 


Going back to quantum mechanics, we see that this effective magnetic 
field causes an effective A to appear in the Hamiltonian. In our experiment 
with the rotating loop, the maximum current varies as the angular velocity is 
varied. By means of this experiment we can measure g/m. 

Devices related to the one described above may have many uses. They 
react so quickly that there are rumors that one can detect radio signals by the 
vibrations of the magnetic fields. More likely, such devices might be useful for 
Measuring small magnetic fields simply by counting the maxima of the current 
oscillations. Perhaps they could be used to maintain fields (or rather fluxes) 
constant to a high degree of precision. Maybe they would be useful for informa- 
tion storage in computers. There may be circumstances in which they could be 
used as a strain gauge, by keeping B fixed and detecting changes of area. 


CHAPTER I1 


SUPERFLUIDITY 


11.1 INTRODUCTION: NATURE OF TRANSITION 


Liquid helium* undergoes a transition at 2.18°K, which may be demonstrated 
in a spectacular manner. When normal liquid helium, known as Hel, at the 
boiling point is cooled to 2.18°K, the boiling abruptly stops and the quiescent 
low-temperature modification, liquid helium JI, (Hell) appears. Hell does 
not boil. This abrupt transition also can be demonstrated by specific-heat 
measurements (see Fig. 11.1). The shape of the specific heat curve resembles a 


¢ 
4 


i | | wie ~ T (°K) 
I 2 3 4 b 


Fig. 11.1 Specific heat of helium near the transition temperature. 


Pressure 


wm 7 (°K) 


Fig. 11.2 Phase Diagram of helium near the transition temperature. 
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“2” and as a result the transition temperature 2.18°K is often called the ‘‘A- 
point.” The specific-heat curve on both sides of the A-point is given empirically 
by (see 1.9.) 


¢, = Boop ti T= Ti: 
a 


<J 


where a, is the constant relevant for T > T, (Hel) and a. for T < T, (Hell). 
This result has not been obtained theoretically and remains one of the unsolved 
problems of superfluidity. At very low temperatures the specific heat is propor- 
tional to T?, a relation that is understood. 


Properties of Liquid Helium: Liquid helium below the j-point, that is, Hell, or 
“superfluid” helium, exhibits several remarkable properties. 


1. As has already been noted, Hell does not boil (although there is evapora- 
tion). This can be explained by assuming the heat conductivity of Hell to be 
essentially infinite. 


2. More remarkable than the apparent infinite heat conductivity is the zero 
viscosity (under certain conditions) or superfluidity of Hell. It has been shown 
that below a critical velocity V,, Hell flows through a thin capillary or ‘‘super- 
leak” with zero resistance. Furthermore, V, increases as the capillary diameter 
decreases. Observing flow through a capillary is not the only way to measure 
viscosity, however. If a cylinder is placed in a liquid helium II bath and rotated, 
there is a momentum transfer from the rotating cylinder to the helium, indicating 
that under the conditions of this experiment, the viscosity is not zero! These 
viscosity experiments can be crudely explained if it is assumed that Hell is 
composed of an intimate mixture of two fluids—one fluid with zero viscosity 
and density p,, and the other with normal viscosity and “‘density”’ p,.* Thus it 
is the zero-viscosity component that flows through capillaries and the normal 
component that interacts with the rotating cylinder. To explain the observed 
data, we must assume that p,/p, is a function of the temperature. 


3. A third remarkable property of Hell is the ‘‘thermomechanical” or 
“fountain effect,” together with the related ‘“‘mechanocaloric effect.” 


Consider two containers of Hell connected by a superleak (Fig. 11.3). 
Let the density, p, and temperature T be kept constant on each side. Then 
there will be superfluid flow through the superleak until the pressure head AP 
is equal to AP = ps AT. s is the specific entropy. This temperature difference 
Biving rise to a pressure difference is known as the thermomechanical effect. If 
One container is a thin tube the pressure head will cause a fountain of liquid 
helium (Fig. 11.4). 


* The reason for the quotation marks about “density” will become evident later. 
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Fig. 11.3. The thermomechanical effect. 
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Fig. 11.4 The fountain effect. 


If the containers of Fig. 11.3 are kept at a constant pressure and there is 
mass flow from A to B, container B will cool. This is the mechanocaloric 
effect. If the two-fluid concept is extended, so that now it is assumed that the 
superfluid component not only has zero viscosity but also carries zero entropy, 
the thermomechanical and mechanocaloric effects can be explained both qualita- 
tively and quantitatively. For example, the cooling of container B in the 
mechanocaloric effect is exactly what one would expect if the mass transferred 
had zero entropy and hence zero temperature. 

The superfluid flow of a zero-entropy, zero-viscosity fluid also explains the 
abnormal heat conductivity of Hell. 

Most of the other “superfluid” properties of Hell can be explained in terms 
of the properties described above. For example, liquid helium II placed in a 
beaker will creep up the surface of the beaker and spill over the sides. This will 
continue until the beaker is empty (it being assumed throughout that the tem- 
perature of the helium is always below the J-point). This rather peculiar behavior 
can be explained by the ordinary physics of evaporation and the infinite thermal 
conductivity and zero viscosity of Hell. 

If any normal liquid is placed in a beaker we would expect that, due to 
Van der Waals attraction between liquid and beaker molecules, a layer of 
liquid, whose thickness decreases with height, will be formed on the beaker 
walls. In all liquids but Hell, however, the layer formation is inhibited by small 
but finite temperature, and hence vapor pressure, differences between the wall 
and the liquid. Thus, depending on whether the wall is warmer or colder, there 
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will be either rapid evaporation from the wall layer or the formation of droplets 
that will fall back into the liquid. In Hell, however, the abnormal heat con- 
ductivity prevents the establishment of any temperature differences, and as a 
result the layer creeps up the side and over the top. 


Landau Two-Fluid Theory: We have seen that both the viscosity experiments 
and the thermomechanical effect can be explained (crudely) by a two-fluid model 
for liquid helium II. A phenomenological theory using the two-fluid concept 
was introduced in 1940 by Tisza* and in another form in 1941 by Landau.} 
Since Landau’s view is the deeper one, we shall concentrate on his formulation. 

Consider Hell to consist of a perfect background fluid (which has zero 
entropy and viscosity), and some type of excitations, which for the moment 
may be regarded as phonons. This simple hypothesis explains a great many of 
the superfluid properties. (The model is somewhat analogous to a solid con- 
sisting of a background lattice plus phonon excitations.) The excitations accord- 
ing to the Landau view are the normal component. 

First, the specific heat of a “phonon gas” is proportional to 7 at low 
temperatures in agreement with Fig. 11.1. Second, as the perfect background 
fluid flows through a superleak the phonons are inhibited because they collide 
with the walls. Thus the perfect fluid emerges with no excitation and hence 
zero entropy. The two-fluid model is further strengthened by the following 
ingenious experiment due to Andronikashvili.t A pile of closely packed discs is 
rotated ina Hell bath. The superfluid component is unaffected, but the phonons 
(and any other excitations) are dragged around with the discs and have ar 
inertial effect, which can be measured. In this way the “density” p, of the 
inertial or normal component can be measured. (Fig. 11.5). 

The two-fluid concept suggests that the two components can oscillate out 
of phase in such a way that the total density of HelI at a point is essentially 
constant but the difference or ratio of the superfluid and normal components i: 
not. The density of excitations, however, is a function of temperature, and hence 
as p,/p varies so must the temperature. This leads to a new type of wave propaga: 
tion known as “‘second sound.” Second sound is a temperature wave and wil 
be excited by heat rather than pressure pulses. 

According to the Landau view, one can think of second sound as a density 
wave in a phonon gas. Ifthe phonon velocity near T = Ois c (c is, of course, the 


* Some of Tisza’s ideas can be found in his following papers: Nature, 171, 913 (1938) 
J. Phys. et Radium, (8) 1, 164, 350 (1940); Phys. Rev. 72, 838 (1947). 

+L. D. Landau, J. Phys. USSR 5, 71 (1941). See also: L. D. Landau and E. M 
Lifshitz, Fluid Mechanics, Addison-Wesley, Reading, Mass., 1959, ch. XVI; anc 
L. D. Landau and E. M. Lifshitz, Statistical Physics, Addison-Wesley, Reading, Mass. 
1959, ch. VI, §§66-67. 

+ E. L. Andronikashvili, J. Phys. USSR 10, 201 (1946); Zh. Exsperim. i Teor. Fiz. 18 
424 (1948). 
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Fig. 11.6 Velocity of second sound. 


velocity of first sound), the ordinary theory of sound propagation predicts that 
the velocity of second sound c, = clV3 as T+ 0. (See also Section 9.2.) 
Tisza associated second sound with the superfluid component rather than the 
normal (phonons), and as a result he predicted that c > 0 as TJ > 0. Fora 
while there was a controversy on this point, but experiment soon proved Landau’s 
supremely confident prediction that c, > cl/V 3 to be essentially correct. (Actually 
Cc, > clV3 at very low temperatures, less than 0.5°K.) The actual curve of 
second sound vs. temperature is given in Fig. 11.6. 

To explain this curve between the J-point and 1°K, it must be assumed that 
there are other excitations besides phonons. Landau derived the excitation 
curve of Fig. 11.7 empirically; it has since been developed theoretically. At the 
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Fig. 11.7 Excitation curve for phonons and rotons. 


lowest temperatures, the mean free path of the phonons becomes large; so it 
becomes difficult to measure the speed of second sound. 
At low temperatures, E(P) = cP and the excitations are phonons. In the 
region around Po, 
(Ps=2P5) 


E(P)=A+4+<——“ , 
2" 


where A is a constant and yp is some effective mass. Landau called these excita- 
tions “rotons.” 

The two-fluid model of Landau can be put on a more quantitative level by 
considering the statistical mechanics of a phonon-roton gas. 

The free energy is 


3 
apy 


F = kTin(1 — e7 Peper 
( ) Comhy3 


(11.1) 


It is clear that the low-energy excitations (phonons) and the excitations around 
Pp (rotons) contribute the most to the integral. Well below 1°K, however, the 
contribution due to the phonons dominates. Above 1°K, the rotons dominate. 
The average energy and specific heat due to the phonons (FE = Pc) is 
found as follows: The expectation value of the number of phonons of energy E is 


you ne) 1 1 
© g~ BinE) 7 eBE _ | ePclkT __ |” 


n= 


oo 2 5).47-4 5,473 
Bw = ( Pc 4nP* dP 4r°-k*T* | _ 16n°k rT (11.2) 
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The specific heat due to the rotons (E = A + (P — Po)?/2y) is 


_ pt? P2A? 5 as) AS Oe al 
Cro = (ny 2232 AS 1+ ode ret ee (11.3) 


To understand the Andronikashvili experiment and the meaning of p,, we must 
consider the statistical mechanics of a moving phonon-roton gas. Perhaps the 
simplest way to proceed is to assume that the superfluid background component 
is moving with velocity V,. Then the energy of an excitation, phonon or roton, 
is given by 

E= E(P) + P- V,. (11.4) 


A similar equation was derived for superconductors. An easy way to see this 
relation is as follows. Suppose a gremlin floating in the fluid gives it a kick and 
produces an excitation of momentum P. Then the velocity of the rest of the 
fluid is decreased by P/M, where M is the mass of the background fluid. If 
the fluid winds up with velocity V, plus the excitation, it must originally have 
had velocity V, + P/M, along with the energy necessary to create the excitations. 
The total energy must therefore be 4M(V, + P/M)? + E(P) = 4MV? + 
P-V, + E(P). The energy the fluid has if it moves at velocity V, without an 
excitation is }MV?. Therefore the energy necessary to excite the fluid without 
changing its velocity is E(P) + P-WV. Now, applying ordinary statistical 
mechanics, we find that the expectation value of the number of phonons of 
momentum P is 
1 


N, = : 
PT QE(P)+PWa)/kT 


(11.5) 


The total momentum density is evidently given by pV, + (P) where 
P a°P 
(P) = lepers 1 nh)? 


_ _[P(PsV[kTeR*™ d?P 
(eF/kT _ 1)? (2nhy? 


= —9,V,. (11.6) 


+ higher order in V, 


By definition 
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In other words, from the point of view just adopted, p, is a derived con- 
cept and is not the density of anything. It is clear from Eq. (11.6.) that 
(p — p,)V, = p,¥, where p, is the density of superfluid. 

As in Section 9 of Chapter 10, we can consider the case of an unstable 
state by minimizing E ~ TS, subject to the condition that there is a fixed 
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expectation value of the momentum. In other words, now the excitations can 
be forced to drift. The result is 


N ] 


P eALE(P)—P Vat PVs] _ 4 


and (<P) = —p,(V, — V,). 


11.2 SUPERFLUIDITY—AN EARLY APPROACH 


We saw in the previous section that the Landau two-fluid theory and the 
empirically derived excitation curve (Fig. 11.7) explain a great many of the 
superfluid properties of liquid helium. Thus, a deep understanding of super- 
superfluidity can be obtained by a theoretical deduction of the excitation curve 
and of course, the transition. In particular, we must explain: 


1. Why there are so few excitations at low energy—which is the central 
feature of superfluidity. 


2. Why the excitation curve has its particular form. 
3. Why there is a transition, and 
4. The quantitative nature of the transition. 


The Transition: Proof that superfluidity is not explained by quantum-hydro- 
dynamics. Although the answer to why there is a transition has been known 
for some time,* this problem has a rather curious history. The Einstein-Bose 
condensation (Section 9 of Chapter 1) predicts that an ideal Bose gas at the 
density of liquid helium will undergo a sharp transition at 3.2°K, which is 
remarkably close to the A-point, 2.18°K. Liquid helium* obeys Bose statistics, 
but is of course not an ideal gas. Nevertheless, it had long been felt that the 
A-point transition is some sort of Bose-Einstein condensation, and as will be 
shown later, this is indeed the case. 

However, a seemingly alternative explanation of the transition was supplied 
by Landau in 1941. Landau, using a procedure termed the “quantization of 
hydrodynamics,” developed a set of commutation relations that seemed to suggest 
that there were an energy gap, phonons, rotons, and so forth. These were never 
deduced by Landau. If, however, it is assumed that the quantization of hydro- 
dynamics gives rise to the excitation curve, it is not unlikely that all of super- 
fluidity including the transition can be explained by ‘“‘quantum hydrodynamics.”’ 
Furthermore, since quantum hydrodynamics has nothing to do with statistics 
(the way Landau developed it), the validity of Landau’s approach would imply 
that *He as well as *He undergoes a transition, and the Bose condensation is 
not the cause of the transition. 


* Feynman, R. P., “Application of Quantum Mechanics to Liquid Helium” in 
Progress in Low Temperature Physics, Vol. 1, edited by C. J. Gorter (1955). Dingle, 
R. B., “Theories of Liquid Helium II,” Advances in Physics 1, No. 2, p. 111 (1952). 
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Thus, for a while there were two alternatives purporting to explain the 
liquid-helium transition. The refusal of *>He to undergo a transition as lower 
and lower temperatures were attained perhaps threw doubt on the Landau 
hypothesis. However, I think I have proved that quantum hydrodynamics 
does not predict the excitation curve as had earlier been hoped by Landau and 
others. An outline of this proof is as follows: 

We consider a fluid and adapt the Lagrangian point of view. That is, we 
describe the state of the fluid by giving the displacement R(rg, ¢) of each particle. 
ro is the position of the particle at time tf = 0 and has the effect of labeling or 
distinguishing it. Each particle obeys quantum mechanics, and since the 
particles are distinguishable, quantum statistics need not be considered. It is 
then possible to find a set of commutation relations, and to show rather easily 
that there are a tremendous number of low-energy excitations (i.e., there is no 
energy gap, etc.). Now it is possible to convert the commutation relations from 
the Lagrangian to the Eulerian point of view. In the Eulerian point of view, the 
density, velocity, and so on, are functions of r and ¢ where r is a fixed point in 
space. The result is that the Eulerian commutation relations so obtained are 
precisely those obtained by Landau! 

Thus, according to Landau’s quantum hydrodynamics there are a multitude 
of low-energy excitations, and hence this theory does not explain superfluidity. 
Landau’s error is that by tacitly assuming the particles to be distinguishable, he 
neglected the effects of quantum statistics. 


Theory of Superfluidity: We now want to explain superfluidity from first 
principles, given a collection of N He* atoms at density py. The two-body 
interatomic potential is given in Fig. 11.8. It is clear from this figure that the 
helium atoms resemble hard spheres of diameter 2.7 A. Since at normal density 
there is 45 A® per atom the atoms are not tightly squeezed. 

The first question one may ask is, why is He* still a liquid at T = 0, at 
zero external pressure? If we calculate the potential energy of a helium lattice 


Potential (°K) 
° 
y 


Atom separation ( A) 


Fig. 11.8 Two-body interatomic potential. 
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and add the zero-point kinetic energy, we obtain the total zero-point energy, 
which is approximately the correct value. However, the kinetic or oscillatory 
energy is so large that the oscillatory amplitude is of the order of a lattice 
spacing. Thus the lattice ‘“‘melts’’ due to the lattice vibration. With the exception 
of He®, no other substance has such a high ratio of zero-point kinetic energy to 
zero-point energy. This is because helium is very light (and hence has high 
natural frequency) and has a saturated outer shell, causing very weak interatomic 
forces. 


11.3 INTUITIVE DERIVATION OF WAVE FUNCTIONS: GROUND STATE 


We now turn to the central feature of HelII, which is the scarcity of available 
low-energy excited states. Rather than look at the Hamiltonian we shall 
“wave our hands,” use analogies with simpler systems, draw pictures, and make 
plausible guesses based on physical intuition to obtain a qualitative picture of 
the solutions (wave functions). This qualitative approach will prove singularly 
successful. 

Consider the ground state g(R,, R2,..., Ry), where R; is the radius vector 
of the ith atom. Each set of 3N numbers (N radius vectors) will be called a 
“configuration” and can be represented by N points in 3-space, the N points 
representing the positions of the N helium atoms (Fig. 11.9). (For ease of 
illustration, the figure is, of course, two dimensional). For each configuration 
there is an amplitude or number ¢, which is high for probable configurations 
and low for improbable ones. We may describe the difference between two 
configurations, A and B, by saying that the atoms ‘“‘move’’ from configuration 
A to B. The word “‘move” does not imply any dynamics. 

Now, what can be said about g? First of all, because He* is a Bose- 
Einstein liquid, @ must be symmetric. That is, an interchange of two particles 
leaves g unchanged. Secondly, in analogy with simpler Bose-system stationary 
states, ~ is real and has no nodes. That is, g is always positive. 
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Fig. 11.9 A configuration of helium atoms in 3-space. 
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To see that @ can be taken to be real, consider Schrédinger’s equation for ~: 
—h? 
| ¥v7 + ¥ ve | o = Eg. 
2m 7 i<j 


Clearly, g* satisfies the same equation; so either ig is real or g + 9* is a 
nonzero real solution to Schrédinger’s equation energy E. 

If there are several degenerate eigenfunctions, first pick one solution, use it 
to construct a real eigenfunction, g,, then consider the space of degenerate 
eigenfunctions orthogonal to g, and repeat the procedure. In this way we can 
obtain a set of real eigenfunctions with energy E. 

To show that @ does not pass through zero if its energy is as low as possible, 
we will show how to find a function of lower average energy [ey when a wave 
function ~ with nodes is given. Since |x) is the lowest possible in the ground 
state, it will show that @ is not the ground state, and therefore that the ground 
state has no nodes. Suppose we fix all the arguments of g except ¢, and let 
P(X1, X2,..., &...) = @(€) look like Fig. 11.10a. Then construct a new wave 
function that looks like Fig. 11.10b. This wave function has the same value of 


and since the energy may be expressed as 


N 2 24qN 
{E) = LeHed R = f [1/2m Di (Vig) + ¥ Vo Jd R (11.8) 
Jo? ad"R fo? dR 
our new wave function has the same energy as the old. We can now lower the 
energy by smoothing out the wave function (see Fig. 11.11), that is, we can 


{a) (b) 


Fig. 11.10. To show that a solution with lower energy can always be found, given 
a wave function with nodes. 
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Fig. 11.11 A smoothed version of Fig. 11.10b. 
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drastically decrease (Vg)? near the node without increasing the y” very much. 
It follows, by the way, that the state of lowest energy is nondegenerate. If 
it were degenerate, we could, by appropriate choice of c, and c,, make a wave 
function 
YW = C19, + C22 


that has nodes, and therefore does not have the lowest possible energy. Thus, 
we can take the state of lowest energy to be real and everywhere nonnegative. 
Figure 11.8 indicates that the helium atoms are almost hard spheres. Thus 
any configuration (e.g., Fig. 11.9) that includes two overlapping atoms gives a 
very small amplitude—call it zero. From Eq. (11.8) we see that the energy is 
high where there are large gradients in g. Thus we want the ground-state wave 
function @ to vary as slowly as possible. Now consider in Fig. 11.12, the atom 
® to “move” about, while the other atoms remain fixed. g for configuration (b) 
must be very small; otherwise there would be a high gradient as ® moves the 
small distance from (b) to (c). In other words, configurations where two atoms are 
very close are unlikely, and in the ground state the most likely configurations are 
those where the atoms are approximately evenly distributed (Fig. 11.12a). 

Thus we have deduced that g is symmetric, real, always positive, largest 
when the atoms are evenly spaced, and lowest when there is clumping. An 
expression that satisfies these requirements is: 


9 = exp | -E 70] = TF) (11.9) 


where r;, is the distance between the i and j atoms, and F is given approximately 
in Fig. 11.13. Fis occasionally approximated by some simple function that fits 
Fig. 11.13, such as F = 1 — a/r;;. Since ¢ is expressed in terms of the one func- 
tion F, it is natural to ask why a variational principle to determine F is not used. 
The answer is that the integral over all the 3N variables cannot be expressed 
simply in terms of F. 
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Fig. 11.12 a, b, andc. A configuration in which one atom “moves” about while all 
the others remain fixed. 
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Fig. 11.13 The function F of Eq. (12.9). 


Note that the nondegeneracy of the ground state implies that it is a sym- 
metric state for any many-particle system, when the particles are identical. For, 
since the Hamiltonian is invariant under a permutation P of the identical 
particles, then g(PR) will also be a ground state when g(R) is. But since there 
is no degeneracy, we must have 


(PR) = a(R), 


and since ~(PR) is always positive and real (all @ values are) we have « = 1. 
Thus ¢ is totally symmetric. (Remember that this function might yet not be 
allowed by statistics.) 


Excited States: Explanation of Superfluidity: Again, in the analogy to simpler 
Bose systems, the first excited state wave function, w, must have one node and 
be symmetric. Since ¢ is always positive and [ py d"R = 0, w must be positive 
for half the configurations and negative for the other half. We now come to 
the key argument of superfluidity: The on/y low-energy excitations are phonons. 
Phonons are excited states of compression, or states involving small displace- 
ments of each atom with a resultant change in density. For the phonons to 
have low energy, the density fluctuations must be over large distances; in other 
words, the phonons must have long wavelengths. The central problem is to see 
why no states other than these phonons can have such low energies. 

Now w, which we assume is not a phonon state, must be orthogonal to each 
phonon state as well as to ¢. This means that y must vary from plus to minus for 
configuration changes that do not alter the large-scale density. If y is to have 
low energy, this change from plus to minus must be very slow, that is, it must 
involve large-scale ‘“movements” of the atoms. But the fact that the particles 
are indistinguishable and obey Bose statistics makes such a movement 
impossible! 
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Fig. 11.14 Two configurations (solid and dotted) that result from large displacements 
(long arrows) of the atoms, can actually be accomplished by much smaller adjustments 
(short arrows) because of the identity of the atoms. (From Reference 1). 


Now let us quote from the first reference cited in Section 11.2*: 


“The function y takes on its maximum positive value for some configuration 
of the atoms. Let us call this configuration A, and the particular locations of the 
atoms «-positions. We said that the «-positions must be well spaced, so that 
the atoms do not overlap, and further that they are, on a large scale, at roughly 
uniform density. Similarly, let us call configuration B, with atomic positions £, 
that for which w has its largest negative value. Now we want B to be different as 
possible from A. We want it to require as much readjustment over a long dis- 
tance as possible to change 4 to B. Otherwise w changes too rapidly and easily 
from plus to minus, our wave function has a high gradient, and the energy of 
the state is not as low as possible. 

“Try to arrange things so that 4 requires a large displacement to be turned 
into B. At first you might suppose it is easy. For example (see Fig. 11.14), in A 
take some atoms in the left side of the box containing the liquid and move it 
across to the other side of the vessel, and call the resulting configuration B. 
One objection to this is that an atom is moved from one side to another, so a 
hole remains at the left and an extra atom is at the right. This represents a density 
variation. To avoid this we may imagine that another atom has been removed 
at the same time from right to left, and the various holes and tight squeezes 
have been ironed out by some minor adjustments of several of the neighboring 
atoms. This movement of two atoms each through a distance equal to the size 
of the vessel, one from left to right and the other from right to left, is certainly 
a long displacement, so B and A seem very different. But they are not. 

“The atoms must be considered as identical; the amplitude must not depend 
on which atom is which. One cannot allow wy to change if one simply permutes 
atoms. The long displacements can be accomplished in two steps. In the first 
step permute the atoms you wish to move to those «-positions closest to the 


* Feynman, R. P., Ibid. 
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ultimate position they are to occupy in the final configuration B. This step does 
not change w because all the atoms are still in the same configuration of «- 
positions. Then the change to the B configuration is made by small readjust- 
ments, no atom moving more than half the atomic separation. In this minor 
motion w must change quickly from plus to minus and the energy cannot be low. 
Because the wave function is unchanged by permutation of the atoms it is 
impossible to get a B configuration very far from the A configuration. No 
very-low-energy excitations can appear (other than phonons) at all. 

“In the phonon case we consider configurations in which, as w changes sign, 
the density distribution changes. A change in density cannot be accomplished by 
permuting atoms. That is why the Bose statistics does not affect phonon states, 
But it leaves them isolated as the lowest states of the system, so that the specific 
heat approaches zero as T approaches zero according to Debye T? law. This 
is the key argument for the understanding of the properties of liquid helium.”’* 


11.4 PHONONS AND ROTONS 


We have been assuming that large-scale density variations are long-wavelength 
phonons. That this is so can be seen as follows: 

Let p(R) be the density of the helium liquid. Then, Fourier analysis of 
p(R) gives p(R) = Dx xe” '*'* and 


H = Vik + C’K7qh), (11.10) 
K 


which is the Hamiltonian for a collection of 3N independent harmonic oscillators 
(normal modes) each of frequency @ = KC. Each mode is quantized and, by 
common usage, if the Ath mode is excited to energy Ex = hKC(n + 4), we 
say that there are n phonons of frequency KC. The ground-state wave function 
for a set of independent oscillators is 


es = T] exp _1 4k = exp ly 4k (11.11) 
eK 2 KC 2“ KC| 


The low excited state of one phonon of momentum AK is given by 


1 = 1 g2 
Xk = a 2 ak ) 4 exp Gee = IxXgs- (11.12) 


2 «+k K'C 


However, 7,, is not the true ground-state wave function in the case of liquid 
helium, since it does not contain enough short distance detail. For example, 
it doesn’t tell us that two atoms cannot overlap, and so forth. Let ~ be the true 


* Feynman, R. P., Phys. Rev. 91, 1291, 1301 (1953); 94, 262 (1954). Feynman, R. P. 
and Cohen, M., Phys. Rev. 102, 1189 (1956). Feynman, R. P., Progress in Low Temper- 
ature Physics (edited by C. J. Gorter), ch. IJ, (1955). 
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ground state, which takes the short distance factors into account. Then in 
analogy to Eq. (11.12), the true low excited state Wx is given by 


Ux = Ix’. (11.13) 


We are talking about long-range density fluctuations, and K is small. Since the 
operator p(R) = >; 6(R — R;), we obtain 


qx = eceen d?R = Yeh ®, (11.14) 
and from Eqs. (11.13) and (11.14) 
W phonon 7 (x ex Q. (11.15) 
Small K i 


Actually, we should write in place of Eq. (11.10) 
p(R) = ¥ (qx sin K*R + qx cos K*R). 
K 
Then 


22 12 
H= Le = = + C°K*(qK + 4k): 
K \ 2 2 
The singly excited states are Wx = qxgand x = qx. Then the state py + ix 
also represents a one-phonon state, and is equal to (>; e'*"*)g. Equation (11.15) 
helps us to understand why it is impossible to have as a low energy non-phonon 
excitation a single atom moving through the helium fluid. For the wave function 
of such an excitation is e'“'*@, which is not symmetric. Blindly symmetrizing 
this function gives >; e’“"*, which represents a phonon excitation. It is 
possible, however, to have a (symmetrized) wave function >°, e*’*@ if 1/K is 
of the order of the atomic spacing, a. The energy associated with such an 
excitation is 
E = h? K?/2m ~ h?/2ma’, 


which is very close to the excitation energy of a roton. 


Rotons: Higher Energy Excitations: We have seen that low-energy non-phonon 
excitations are impossible. In other words, there are no possible long-distance 
movements of the atoms that do not change the density. So now let us try to 
construct a wave function w that goes from plus to minus, without a density 
change, as slowly as possible, but which of necessity involves only short 
movements. 

Again let A be the configuration for which y takes on its maximum positive 
value, and B the configuration for which w takes on its maximum negative values. 
The positions of the atoms in configuration A will be called «-positions and, in 
configuration B, B-positions. The change from A to B must involve only a small 
displacement of each atom and yet involve as large a total movement as possible. 
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Fig. 11.15 In this configuration the f-sites are between the a-sites. 


We know that the «-sites in configuration A must be evenly spaced, and the 
same for the f-sites in B. These conditions imply that the f-sites are between 
the «-sites as in Fig. 11.15. 

Suppose for a moment that each atom is either on an «- or a f-site, and let 
N, and Nz be the number of atoms in each type of position. Then 
wy = (N, — Ng)y is maximum positive for configuration A, maximum negative 
for configuration B, and passes smoothly from A to B. w can be expressed 
mathematically by defining a function f(R;) to be +1 if R; is at an a-site and 
—1 if R; is at a f-site. 


v= (Z/e) 0. (11.16) 


For an intermediate position (that is, one for which R; is between an « and 
B-site), w will vary smoothly if f(R;) is taken to vary in some smooth way 
between its extreme values of +1 and —1. 

We now want to use the variational principle to find the best possible f(R), if 


¥ =D (R)p = Fo: F = LS (R). (11.17) 
The energy integral to be minimized is 
2 ai 27 43N 
<E) ae b>; (A eT a jd R ; (11.18) 


When w = Fo, this integral can be written in an interesting way, which is 
quite general (though we take @ real and also assume q vanishes at infinity). 
Replacing y by Fo in Eq. (11.18) gives us 
2 
(E) = » fa | | IF PW9XVi9) d2°R + | (F*V,o)gV.F) dR 
7 2m 


zs | eviorocnr*) d°*R + [eer ad 


+ [ViFe® aval l [lero d°R. (11.19) 
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Integrating the first integral on the right of Eq. (11.19) by parts and making the 
substitution Hp = Eog gives finally 


= J (#7 /2m)|ViF py arR 
IF Pen d2°R 


e= (E) — (11.20) 


where 
py(Ry,..., Ry) = p(B,» +; Ry). (11.21) 


Py is evidently the probability of a given configuration. ¢ is the energy of 
an excitation, and £, is the energy of the ground state. 

Returning now to the specifics of the liquid helium problem, we have F = 
>; /(R;) and we wish to minimize Eq. (11.20) to find the best value for f. The 
denominator of Eq. (11.20) becomes 


[lries d°*R ~ a [rere il PN Il a°R,| d°R; d°R, 
7 j k#i,j 


Ul 


bs | F*(R)f(Ry)g(R, — Ry) dR, PR, (11.22) 


where 
pogR, — Rs) = VY. | 5(R; — R,) HR! — Ry)py dR’ 
uss 


Here ¢ is normalized so that { py d*R = N = number of atoms. py is the 
liquid number density in the ground state and g(R, — R,) is the probability of 
finding an atom at R, per unit volume if one is known to be R,. The numerator 
of Eq. (11.20) can be written as 


2 2 
y ie IVFl?py dR = po 3 | WARP PR. (11.23) 
r m 


i 2m 
Equation (11.20) can now be rewritten as 


__ #?/2m f IVS(R)? dR 
~ PPR SRDGAR, — R,) d?R, d?R, 


(11.24) 


By setting the variation de = 0 to minimize é we obtain the integrodifferential 
equation 
2 
~ V*f(R) = c | oR — R’)f(R’) dR’. (11.25) 
m 
The solution to Eq. (11.25) is f(R) = e*'®. 


= (= on) Q, (11.26) 
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and 
h? K? 
a= 
2mS(K) 


where S(K) = fenton d3R. (11.27) 


S(K) is the form factor for scattering of neutrons from the liquid and can be 
found experimentally (Fig. 11.16). From Eq. (11.27) it is the Fourier transform 
of g(R), which is the probability per unit volume of finding an atom at a distance 
R from a given atom in the liquid in the ground state. 

Since states of different momenta are orthogonal, the energies of Eq. (11.27) 
are significant not only for K near the minimum but also in the neighborhood 
of this value. Although in the derivation of Eq. (11.27) it has been assumed that 
K ~ 2n/a (ie., K is large) where a is the atomic spacing, the analysts is valid for 
all K as long as W = (3 f(R,))g is a reasonably good wave function. But we 
see from Eq. (11.15) that for small K, ~ = (>; f(R))¢ represents phonons, and 
is a very good wave function. Hence the results of Eqs (11.26) and (11.27) are 
valid over the entire range of K. From Eq. (11.27) and Fig. 11.16 we can 
plot e(K). 

The minimum occurs near K = 1/a, and the excitations in this region are 
the rotons. 

It is interesting to note that the excitation curve (Fig. 11.17) can now be 
determined directly, by measuring the energy loss suffered by a scattered neutron 
of incident momentum 7K. For a given angle of scattering, there will be a 
minimum energy loss, which is the energy e(K) of a single excitation. 


11.5 ROTONS 


A roton can be visualized as a group of children going down a slide. The slide 
is the AB part in Fig. 11.18. After sliding down, the children go around and 
come back to the slide again. 

The returning-flow part is not included in Eq. (11.26) of Section 11.4 for the 
roton wave function 


Y= (Set) oye (11.28) 


but is derived as follows. When we construct a wave packet out of the rotons of 
Eq. (11.28), we have a localized excitation. This localized wave packet does 
not move, because the group velocity of the roton state is zero as is seen in 
Fig. 11.17 (where de/dk = 0). This localized excitation is associated with the 
momentum AK. When we calculate the quantum mechanical current density j 
for a roton, we find it is not zero: 


j#0. (11.29) 
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Fig. 11.16 Experimental result for the form factors for scattering neutrons from the 
liquid. 
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Fig. 11.17 Experimental and theoretical excitation curves. [Feynman, Progress in Low 
Temperature Physics (1955) ]. 
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Fig. 11.18 Schematic of the movement in a roton. 
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Fig. 11.19 A wave packet of rotons. 


Fig. 11.20 Conservation of current. 


Then we have the picture shown in Fig. 11.19. We see j # 0 in a localized 
region and j = 0 outside. This contradicts the Schrédinger equation, since 
the current must be conserved. 

This difficulty is avoided by considering the slow return of current to the 
packet (Fig. 11.20). Mathematically this is achieved by improving Eq. (11.28) 
by writing 


y = (x exp l« *R; + VIR: al 1) Peround (11.30) 


and finding the best form of f(R; — R,). After the variation we find the best 
form is 


R.—R, 
= VeiRR( Lt eV —L | oo ona (11.31) 
vad Fie, — Rp) 9m 

This form conserves the current. 
As is suggested by Figs. 11.18 and 11.20, the roton is like a classical smoke 
ring, as shown in Fig. 11.21. There is one difference, though. The smoke ring 
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Fig. 11.21 A roton has the same shape as a perfect smoke ring. 


a 


Air flow ----+» 
a 


Fig. 11.22 The attraction between two opposite vortices of a smoke ring is balanced by 
its motion. 


moves, whereas the roton does not, as has already been mentioned. Because 
two vortices of opposite sense attract each other (Fig. 11.22), there is another 
important difference: In the case of the smoke ring, this attraction is balanced 
by the flow of air through it, so that the ring does not shrink to a point. For 
the roton, since it does not move, the diameter of the ring shrinks down to the 
atomic scale. 

When the atom in position A in Fig. 11.18 goes to position B, the state 
returns to itself. This leads to a condition on K in e'* ®, K is of the order of 
1/a where a is the average distance between atoms. This agrees with the curve 
in Fig. 11.20. Thus the roton is associated with a momentum and velocity of 
approximately 


p = Ala; v = h/ma. (11.32) 


When there is more than one roton, they interact. The interaction becomes 
important for high temperatures. Landau and Khalatnikov* calculated the 
attenuation of the first and second sounds from kinetic theory taking into 
account the cross sections of phonon-phonon, phonon-roton and roton-roton 
scattering. 


* For detailed discussion and references, see I. M. Khalatnikov, An Introduction to 
the Theory of Superfluidity, W. H. Benjamin, 1965. 
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11.6 CRITICAL VELOCITY 


Let us find the energy necessary to create an excitation of momentum p. 
Suppose a body of velocity V and mass M creates the excitation and winds up 
moving with velocity V’. We see by conservation of momentum 


MV = MV’ + p, 


so that the new kinetic energy of the body is 


2 
MV? = 4MV2 — Vep+ ae (11.33) 


The energy of the excitation is written as ¢(p). The excitation cannot be created 
unless 


SMV? > 4MV” + ep), 
sO 


Speen agp 
&(p) < Vip Vil V-p (11.34) 
for large M. 

The critical velocity necessary to create an excitation is then derived by 
drawing a tangent to the e(p) vs. p curve as shown in Fig. 11.23. 

The critical velocity calculated from this condition is about 60 m/sec. 

We might say that this is the reason why helium is a superfluid; it is not 
excited for small perturbations. Although this explanation seems to be 
qualitatively satisfactory, excitations occur for velocities of about | cm/sec. 
Now we have to reverse our position and look for the reason why liquid He 
can be excited with such a small velocity. 


ep) 


Critical condition, 
ep) = Vp 


Fig. 11.23. The critical velocity is at tangent to the e(p) versus p curve. 
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11.7 IRROTATIONAL SUPERFLUID FLOW 


When liquid helium is undergoing a mass motion of velocity V the wave function 
is 


wv = exp ( ‘a > << r,) P ground: (1 1.35) 


Note the difference between Eq. (11.35) and the roton or the phonon wave 
function of Eq. (11.15). The exponential part in Eq. (11.35) may be interpreted 
as follows. 


P°-Rog = nmr (d RIN) = mv + R,, (11.36) 


where P is the momentum of the whole system, Rog the center-of-mass 
coordinate, and N the number of particles in the system. 
As in the case of superconductivity, for a nonuniform velocity we assume 


y = exp |: > aR |  ground> (11.37) 
and the velocity field V(R) at R is found from 0(R) by using 
V(R) = a VA(R). (11.38) 
m 


We see from Eq. (11.38) that 
Vx V(R) = 0, 


so that when the system is singly connected there is no rotation. When the 
system is doubly connected, we can have a circulating V in the system as pictured 
in Fig. 11.24. We take 


0 = ka. 
For this choice of 6, V(R) is perpendicular to the radius and has magnitude 
hk 
IVE =-—. 
mr 


Oy 


Fig. 11.24 Circulating V in a doubly connected system. 
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k is restricted to the integers by the requirement that the wave function must be 
single valued. 


0 = ne 
jVj = ne n *(11.39) 
mr 
and 
pyas = 27 a n, 
m 


The unit of quantization is 


att = 2n x 1.5 x 107+ cm?/sec. 
m 


What is the energy for this flow? If po is the fluid density in atoms per cc, the 
kinetic energy is (for n = 


I) 
5 h 2 
KE = | pom ( ) 2nr drL, (11.40) 
- mr 


where a and b are the radii as shown in Fig. 11.24, and L is the depth of the 
liquid. Thus: 


2 
Line energy per unit length = pox us In @) = 107? In (’) ergs/cm, 
m a a 
(11.41) 


where we used py = (3.6)7> A? = 4 A®. If we neglect atomic structure and 
assume a Classical continuous liquid with surface tension, a unit line makes a 
hole. opposed by surface tension that we can calculate to be only 0.4 A in radius. 
That means that there is no real hole in the liquid. 

The energy associated with a vortex line can be estimated from Eq. (11.41). 
The lower limit a of the integral in Eq. (11.40) is considered as a length of the 
order of atomic spacing, because within about the atomic spacing the velocity 
formula is meaningless (and also the density is low near the center of the vortex). 
The exact determination of a would require us to solve the difficult quantum- 
mechanical problem. In almost all applications the ratio b/a will be very large. 
and the logarithm large enough to be insensitive to the exact value of a. For 
this reason we will not attempt a detailed evaluation, but will simply choose @ 
to be close to the atomic spacing. We may arbitrarily take a = 4.0 A. 


* The quantization of a vortex line was first suggested by L. Onsager, Nuovo Cimento 
6, Suppl. 2, 246 (1949). 
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11.8 ROTATION OF THE SUPERFLUID* 


Let us consider the state of rotating liquid helium in a can (see Fig. 11.25). 
Operationally this state is defined by starting from solid helium in the can 
(under pressure > 25 atm at 0°K). First rotate the solid helium and then release 
the pressure to melt it; the liquid helium is then rotating with the angular 
momentum initially given to the solid. What is the final state of helium? We 
ask then for the lowest state of this rotating helium. 

For a system of given angular momentum, the kinetic energy is the least if 
the angular velocity w is a constant throughout the liquid. However, this motion 
is not rotation-free, because V x V = 2m, and a high energy (see Eq. 11.41) is 
required in order to set any small part of liquid helium into a rotational state. 
If we consider the helium to be a rigid body rotating, we get a reasonable energy 
for the rotation, but we cannot consider helium to be rigid. The energy to clamp 
the atoms into a rigid relationship with one another is very high, and must be 
taken into account if one wishes to calculate the energy for uniform rotation, 
with V x V, # O everywhere. A simple way to check the validity of the above 
reasoning is to consider two atoms of mass m in a harmonic potential. The 
energy of excitation of one of the atoms is 


But if the atoms could be rigidly held together, that energy would become 
h(k/2m)'?, which apparently would mean that the presence of a second atom 
allows a lower energy of excitation. This paradox is resolved by noting that it 
takes energy to hold the atoms together. If only a limited energy is available, 
nearly all the parts of the fluid must be frozen out in their ground states. That 
is, nearly everywhere the local angular momentum is zero, so that, V x V = 0 
except at certain discontinuities. Suppose we assume that the angular momentum 
is carried by excitations. That is, the liquid is not free of excitations such as rotons 


Fig. 11.25 Rotating can of helium. 


*R. P. Feynman, chapter If in vol. I Progress in Low Temperature Physics, edited 
by C. J. Gorter (1955). 
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and phonons even when the temperature is O°K. The number and type of 
excitations is to be found by minimizing the energy for the given value of angular 
momentum. This energy, for vessels of centimeter dimensions turning at about 
one radian per second, turns out to be nearly 10* times the energy of a rigid 
body rotating at the same angular velocity. We must reject such a model. 

Thus, in order to achieve a state of lower energy for the given angular 
momentum, we examine velocity fields that are not everywhere continuous. 
As we saw in Section 11.7, we know that if there is a hole at the center of the 
liquid, circulation can exist. Thus a solution is suggested: the liquid forms a 
vortex around a hole with constant circulation as analyzed in Section 11.7. 
The velocity varies inversely as the radius, rising to such heights near the center 
as to be able to maintain the hole free of liquid by centrifugal force. The energy 
for this state is still quite a bit higher than the kinetic energy for the rigid body 
case, because the velocity instead of being distributed proportionally to the 
radius (the solid case), actually falls as the radius increases (see Eq. 11.39). 
Nevertheless it is orders of magnitude below the continuous V-field with excita- 
tions as suggested above. 

Once we admit the possibility that V is not necessarily continuous we can 
think of a state of lower energy. Suppose that the liquid has not only one vortex 
at the center, but several vortices. For example, suppose beside the central one 
there were a number distributed about the circle of radius R/2, half that of the 
vessel R, and all turning the same way as shown in Fig. 11.26. If the number of 
vortices at R/2 is large, this state is like a vortex sheet so the tangential velocity 
can jump as we pass from inside R/2 to outside as shown in Fig. 11.27, The 
gain in energy resulting from this improved distribution may more than com- 
pensate for the energy needed to make the additional holes (and, further, the 
central vortex need not now be so large and energetic). 

Continuing in this way with ever more vortices we soon notice that the 
energy can always be reduced if more vortices form. However, there is a limit. 
Due to the quantization of the vortex strength, the smallest vortex has cir- 
culation 2xh/m. The lowest energy results ifa large number of minimum-strength 
vortex lines (which we will call ‘unit lines”) form throughout the fluid at nearly 


Fig. 11.26 The vessel as seen from above, with vortices all turning the same way. 


11.9 A reasoning leading to yortex lines 339 


Tangential 
velocity 


aaa * Radial distance 
R/2 


Fig. 11.27 Discontinuity in tangential velocity at R/2. 


uniform density. The lines are all parallel to the axis of rotation. Since the curl 
of the velocity is the circulation per unit area, and the curl is 2m, there will 
be (by Stokes’s theorem) 

m 


2m —- = 2.1 x 10° lines/em? (11.42) 
2nh 


with @ in radians/second. For @m = | radians/second the lines are about 
0.2 mm apart, so that the velocity distribution is practically uniform. 

The energy associated with one vortex line is derived from Eq. (11.41). 
Taking a = 4.0 A and 6 = 0.2 mm, and using Eq. (11.42) we find 


Total line energy per unit volume = pow In (’) = 14p mh. (11.43) 
a 


Thus the total line energy in the rotating liquid of radius R and a unit thickness 
is 14p9m@hxR?. The kinetic energy for a rigid rotating body of radius R and 
unit thickness is mmp)R*w7/4. The ratio of the two is of order 


4-" x 14 = 107? (11.44) 
R*w 


(the number 107? is calculated for R = | cm and w = | rad/sec). For macro- 
scopic laboratory dimensions the excess energy to form the lines is small. 
They would form if rotating solid helium is melted by releasing the pressure, the 
angular velocity distribution would differ imperceptibly from uniformity, and 
the surface should appear parabolic. 


11.9 A REASONING LEADING TO VORTEX LINES 


Suppose liquid helium is separated into an upper and a lower part by a friction- 
less surface, and the upper part is moving with velocity V (see Fig. 11.28). 
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Fig. 11.28 A frictionless surface separating moving liquid helium from stationary 
liquid helium. 
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Fig. 11.29 Discontinuities at the dividing surface. 
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Fig. 11.30 Vortex lines. 


In the upper part the wave function contains the factor 
exp @ vey Ri). 


When the dividing surface is removed there appears the surface free energy or 
the surface tension. Because at periodic points exp(im/h)vR = 1, there is no surface 
tension at these points. The dividing surface is pictured in Fig. 11.29: around a 
discontinuity there is a circulation. Now you may ask, what is the best size of 
the slot for discontinuity? A calculation shows that the discontinuity is not the 
shape of a slot, but a line, so that the dividing surface would look like Fig. 11.30. 
Thus, this reasoning may lead to the assumption of the existence of vortex lines. 

Consider what would happen if liquid is flowing out of an orifice into a 
reservoir of fluid at rest. If the flow is irrotational, it looks like Fig. 11.31. A 
very high velocity develops near the corners and there are large accelerations 
there. An ordinary fluid, such as water, flows in a more complicated manner. 
For a rough estimate, let us suppose that the tube is a long narrow slot, and 
suppose the fluid tries to go out in a jet, at first of the same width and velocity 
as in the tube. Then outside this tube we have a situation similar to that shown 
in Fig. 11.28 and following the same reasoning as before, we expect to see the 
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Fig. 11.32 Vortex lines outside an orifice. 


vortex lines shown in Fig. 11.30 or in Fig. 11.32. Integrating over the dotted 


line we find 
V, = a = nm). 
m 


where n is the number of vortex lines in length |. The number of vortices per 
unit length along the line of flow is 
is daments (11.45) 
(2nh/m)  2nh 


These vortex lines have an associated energy. From Eq. (11.41) the energy 
per unit length of the vortex line may be written as 


2 
~ 1p : In (‘) , (11.46) 


m a 
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where p is the density of liquid helium, d is the width of the jet in Fig. 11.32, 
and a is the spacing between atoms, roughly 4 A as was stated at the end of 
Section 11.7. The vortices must move out at velocity V/2, the velocity at the 
location of the vortices. Thus the energy of vortices created per unit time per 
unit length of slot is 


2: 2 
re ei ps. gi Car itl. cata 
2 2nh m a 


In order to form the vortex lines, the kinetic energy of the jet per unit slot 
length must be greater than Eq. (11.47).. The kinetic energy of flow coming 
out of the orifice per unit time per unit slot length is 


2 


Bo Vd. (11.48) 


The critical velocity is obtained by equating Eq. (11.47) and Eq. (11.48): 
jy apt eae (‘) (11.49) 


dm a 


For velocity greater than vo, there is enough energy in the flow to create vortices. 
To get a rough idea, if we use d = 107° cm, so that In (d/a) = 6, we see 


Vo = 100 cm/sec. (11.50) 


The experimental value is about 20 cm/sec. The difference suggests that the 
model we have used is oversimplified. 

There is a connection between the concepts of vortex lines and rotons. 
Starting from the vortex lines as shown in Fig. 11.32 we see that they will 
gradually change into rotons as shown in Fig. 11.33. A vortex ring can be 
broken into two smaller rings. A roton may be regarded as a small vortex ring, as 
discussed in Section 11.5, 

There are experimental verifications of vortex lines. See, for instance, 
W. F. Vinen and H. E. Hall, Proc. Roy. Soc. 238, 204 (1956); 238, 215 (1956): 
H.E. Hall, Proc. Roy. Soc. 245, 546 (1958): W. F. Vinen, Nature 181, 1524 (1958). 
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Fig. 11.33 Vortex lines changing into rotons. 


11.10 The 4 transition in liquid helium 343 


11.10 THE 4 TRANSITION IN LIQUID HELIUM 


When we know the curve for E(p), the energy versus momentum function, then 
the partition function and the specific heat can be calculated if we assume each 
excitation is independent. This calculation leads to the curve of the specific 
heat, which agrees with the experiment for the part drawn in solid in Fig. 11.34. 

The above approach, however, does not explain the existence of the transi- 
tion shown by the dotted curves in Fig. 11.34, for the interaction between 
excitations is not taken into account. 

For the region near the transition point the E(p) curve in Fig. 11.34 is not 
of much help. We start afresh and use a different method. 

In terms of path integrals we have 


bp a if exo(-} [" » gt de «| iu) 


xi(0)= xi 
Xi(U) = px; 
Bx, Gxy++* Dey dx,+ ++ dxy (11.51) 
As we concluded in Section 4.1 for high enough temperatures only the identity 
permutation counts, and we get approximately the classical partition function 
of helium. In this case, “thigh enough temperatures” means (1/A) [§ (mx?/2) 
is large for x(0) = position of one atom, x(U) = position of another atom. If 
d is the average interatomic distance, x is'on the average d/u and (1/h) |$ (mx?/2) 
is of order mkTd?/2h?. For T » 2°K, 


and permutations cannot be important in Eq. (11.51). For low T (high U) 
permutations are important. Furthermore, the approximation that 
V[x,{u) — x,(u)] is independent of u for important paths is true only if U is 
small, so that x(u) ~ x(0) to have an appreciable contribution to the path 


- 7 


oO 


Fig. 11.34 Specific heat of liquid helium; theory and experiment agree for the part 
shown as a solid line. 
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integral. For low temperatures, x(w) can vary greatly without giving rise to a 
large value of X?(u), and thus V(x; — x,;) can vary over the path. 

To calculate the A transition, we must concern ourselves with what happens 
at low temperatures. We can visualize each path of the integral as a motion of 
every particle of the liquid from one position to another. ‘‘v’” is thought of as 
time, and x,u) is the position of the ith particle of ‘‘time” u. If, during this 
motion, x(u) becomes too large, or if V becomes too large, the path gives only a 
small contribution, V[xu) — x,(u)] is large if at “time” w particles i and j are 
close to each other. Of course we are not claiming that x,(u) is really the position 
of some atom at an actual time. But thinking in the manner described above 
allows one to apply physical intuition effectively. 

Consider a permutation in which a particle at position A moves to the 
position B, while the particle at B moves to a third position. (See Fig. 11.35.) 
We will consider the contribution to (1/4) ¥ | (mx?/2) caused by A. 

As A moves to position B, it must somehow avoid passing through particles 
such as C. To get to position Bin time U, particle 4 must move quickly enough 
to cover the distance. Since x is large for particle A, it is increased a large 
amount by increments of velocity so A does not have time to avoid C by trying 
to move around it. Instead, C is jostled out of the way. Suppose, now, that the 
distance from A to Bis r, and suppose that there are n particles that have to be 
pushed out of the way. 7 is of the order of r/d ,where d is the interatomic spacing. 
Each particle along the path has to move a distance of order d’ to avoid being 
too close to particle 4. Particle A is close to particle C for a time of order u/n, 
so that particle C must move with velocity d’/(u/n) during that time, giving 


contribution 
PN 12 u 2 
m d u_ md'*n ‘is 1 y mxi 
_ 2h ujn}] n 2hu hjoT 2 


Fig. 11.35 Particle at 4 moves to B while particle at B moves elsewhere. 
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Thus the v particles give contribution 
md’*n? __ m [(d'\ r? 
2hu = 2h? @ B 
Particle 4 itself contributes mr?/2h7B. Thus the effect of the potential is to 
change e7 "7/24? to e~(—ym7/2B) where y = 1 + (d'/d)?. 

We are going to proceed as if y were independent of r and f, and our 
reasoning above shows that this assumption is not too unreasonable. But 
remember that for displacements of order d we may not need as much adjust- 
ment, so y may be less if r is small. Also if the velocity is especially high, it may 
be preferable to pass through relatively high potentials, rather than have rapid 


adjustments. Thus again y would be lowered. Let us ignore such details. 
It is convenient to define the effective mass m’ as 


t 


my = 


so that we may write e~”/?""4r?_ Thus in the liquid state, so far as the path 
integral is concerned, a particle behaves like an ideal-gas particle* of mass m’. 
We have arrived at the important conclusion that the interaction among 
particles is ‘‘kinetic’’ rather than “potential.” Kinetic interaction varies as 
r?/B, while potential varies as rB (as can be seen by estimating the potential 
contribution for paths that do not avoid high potentials). 
Therefore the partition function for low temperatures has the form 


pin: m’ \3N/2 
N! © } \2nhp 


x xp (- 7p - PR?) (Ri, Ros» Re) d°R, d?R,-°° 
(11.52) 


The last factor comes from the potential-energy contribution of the initial 
configuration. The reason for the factor (m’/2h?B)°*’? is not obvious, for it is 
not obvious how a change in the effective mass in the path integral will change 
the normalization of the integral. To be complete, we should instead write a 
factor K,(m'/2xh7p)°%’?. For simplicity, we will pay no attention to K,. The 
factor p is clearly small for initial configurations (equal to the final configurations) 
such that atoms overlap. We can, for example, expect results that are 
qualitatively correct by choosing p = 0 if any two R’s are closer than b (with 
““b” roughly 2.6 A) and p = 1 otherwise. 

A permutation among particles can be visualized as a polygon with arrows. 
The example shown in Fig. 11.36 consists of several cycles of length 1 (particles 


* The above results apply only to Bose particles. Fermi particles behave differently. 
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Fig. 11.36 Permutation among particles; here there is one cycle of length 2 and two 
of length 3. 


that are not permutated), one cycle of length 2, and two cycles of length 3. 
Thus, the sum 3°, in Eq. (11.45) is a sum over all possible polygon patterns. 

A way to estimate Eq. (11.45) for temperatures near the transition tem- 
perature will be explained here.* For such temperatures, a side (of a polygon) 
longer than the average interatomic distance d is not important. Thus we may 
assume that the sides of all polygons that contribute have length of order d. 

Given a particular configuration (R,, R,,..., Ry) we restrict our permuta- 
tions to be such that R; and PR; are close. In order to get a contribution to the 
integral in Eq. (11.45), p forces the atoms to be roughly uniformly distributed. 
Consider the sum 


Zew| ~ F 


Large shifts in the R; that preserve the uniform distribution effectively permute 
the atoms and thus leave the above sum unchanged, except for the change that 
would result from small shifts in the R;. The effect of smaller shifts in the R; 
that do not permute the atoms can be included by replacing R; — PR, with an 
average interatomic distance, d. For all uniform distributions of atoms, we can 
therefore replace 


TR, ~ Pry]. 


2hB 4 


x exp | a 2p? (Ri - PR) | 
by 
Pp 2h B 


YY exp iC Ae) ; 


* For a more complete discussion, see R. P. Feynman, Phys. Rev. 91, 1291 (1953). 
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where >°,, is the sum over all permutations of a fixed lattice such that the resul- 
ting polygons are made up of lines joining near neighbors, and n(p) is the total 
number of sides in the set of polygons defined by P. We can now take the above 
sum out of the integral in Eq. (11.45) and write 


, 


m 3N/2 13 ; 
e *®F — constant x — x s em ant p)/2hB 
2nh?B : 


In this equation, the difficult problem is the calculation of the sum. So ignore 
the first two factors on the right and set y = e~"#/2"’8_ We must compute 


eT PF = HY yn), 
p 


So we have the following problem: Given a lattice of points, lines can form 
between adjacent points to form polygons. With each line is associated an 
energy (m’d?/2h’ B). We wish to calculate the free energy of this system. Let 7, 
be the number of polygons with s sides and let C(n,, n,,...) be the number of 
permutations with n, polygons of s sides. Then 


e FF = om C(ny, me) y sn. 


My M2... 
This sum is restricted by the equation 

fo 6) 

¥ sn, = N = number of atoms. 

The restriction on the sum can be eliminated by the usual method of letting 
N vary while assigning to each atom a weighting factor, t (in Chapter 1, ‘‘t”’ 


was e?), 


Q=e% 


»» Cor, ma 9S m)e(S mn) 


NYN2, 26. 


Sy ores, (11.53) 
N 


From Eq. (11.46) it is easy to see that the expectation value of N is 


Cus a SEO 2 ag (11.54) 
dt dt 

To evaluate Eq. (11.46), we must find a way of approximating C(,, 2,...). 

We will approximate C by writing 

Rv 


C(my, ny...) = 2 
Ss n,! 
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where R is the number of ways of drawing a polygon with s vertices on a lattice 
of (N) points (with all sides joining nearest neighbors). This approximation is 
based on the following assumptions: 


a. The number of ways of drawing a polygon with s sides is roughly independent 
of the polygons that have already been drawn. The effect of the competition 
between polygons for available atoms is taken into account by the factor 


be 99 


t 

b. If we allow overlapping of polygons there will not be much error. 

ce. n, small enough so that given n, random polygons with s sides, there prob- 
ably will not be any that are on top of each other. 


Then we can choose n, polygons of length s in about R"/n,! ways. By 
assumption (a), the total number of ways of choosing sets of polygons with 7, 
polygons with s sides is 

Ris 
Equation (11.46) becomes 


= exp [R,1] exp | Roe 
or 
g = —kT & + : Ron], (11.55a) 
t is chosen so that 
CN) = Ryt + > sR yt)’. (11.56a) 


We now must find R,. If s = 1, we wish to find R,, the number of ways of 
picking one point out of (NV), so R, = (N). Hereafter, write (N) = N. In 
general, to find R, we let 4, equal the number of polygons with s sides that can 
be drawn starting at a given atom. Then NA, would be the number of ways of 
drawing a polygon with s vertices if we started at any atom; but we have counted 
each polygon s times. Thus R, = Nh,/s. To avoid worrying about what happens 
when we get near the boundary, we simply consider the system to have periodic 
boundary conditions. That is, in two dimensions the system would have the 
topology of a torus. We will now approximate A, for fairly large s assuming 
that the temperatures are such that small s does not contribute much to the 
sums in Eq. (11.55a) and Eq. (11.56a). Suppose / is the number of nearest 
neighbors of each lattice point. Consider the problem of random walk on the 
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lattice. There are /* equally probable random walks that start on a given atom 
and make s steps. In h,, we wish to consider only those ways in which the walks 
terminate at the origin. The fraction of walks that end up at the origin is in- 
versely proportional to the volume of space in which the walk is likely to end. 
Since the distance from the origin of the typical random walk is of the order 
dys (d = length of each step), the volume in which a typical walk ends is 
proportional to s*/?,. Thus A, is proportional to /S/s?/? for large s, where the 
proportionality constant is independent of s. Equations (11.55a) and (11.56a) 
become 


Ks) 
| 
| 
2 
‘a 
~ 
a 
+ 
i) 
M 
cy 
<< 
= 


(11.55b) 


+ 
OQ 
p48 
oy 
a 
“sd 
= 
— 


(11.56b) 


where c is a proportionality constant. For extremely large s, our result above 
for h, is incorrect. If one chain covers a volume comparable to the size of the 
whole system, then A, = /*/N, where 1/N is the probability that a random walk 
will end at the original point (or any other particular point). Then a more 


realistic h, might be 
h, © S z x) 1S, 
$3/2 N 


which has the correct behavior for medium and large s. In any case, the sums in 
Egs. (11.55a) and (11.56a) converge for ty] < I and diverge for ty/ > 1. At 
reasonably high temperatures y is small and the sum converges. Also, at such 
temperatures there are not too many polygons, so assumptions (a), (b), and (c) 
are not too unrealistic. Thus we expect our results to be qualitatively correct 
above the 4 point. Equations (11.55b) and (11.56b) resemble the results for a 
Bose gas (see Section 1.8) and lead to an increase in specific heat with decreasing 
temperature. The shape of the specific heat curve is shown in Fig. 11.37. 
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Fig. 11.37 Specific heat near the lambda point. 
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This analysis (which is supposed to be good for T > T,) thus explains why 
C increases as T approaches T, from above. This treatment does not lead to the 
discontinuity of the kind shown in Fig. 11.34, probably because of the approx- 
imations made between Eqs. (11.52) and (11.55a). In other words, we expect 
that the volume of the helium atoms described by p in Eq. (11.52) is the cause 
of the discontinuity in the specific heat. Kikuchi and others have written papers 
in which it is indicated that more accurate computations starting from Eq. 
(11.52) would lead to a discontinuity.* 


Problem: How do these polygon calculations differ from the ones we did 
in Chapter 5 for the Onsager problem? 


* Kikuchi, R., Phys. Rev. 96, 563 (1954). Kikuchi, R., Denman, H. H., and Schrei- 
ber, C. L., Phys. Rev. 119, 1823 (1960). 
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